IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Application of 
BLONDELET-ROUAULT, et al. 

Application No.: 10/680,860 

Filed: October 7, ^[©3 

Title: POLYPEPTIDES INVOLVED IN THE 
BIOSYNTHESIS OF SPIRAMYCINS, 
NUCLEOTIDE SEQUENCE ENCODING 
THESE POLYPEPTIDES AND 
APPLICATIONS THEREOF 



Examiner: Not Yet Known 



Group Art Unit.: Not Yet Known 



Cei 

I hereby certify | 
with the United 
for first class 
Patent Appli 
transmitted to the 1 
Date of Deposij 
Printed Name 



Icate 



of Mailing or Transmission 
t this correspondence is being deposited 
tes Postal Service with sufficient postage 
in an envelope addressed to the Mail Stop 
m address below, or being facsimile 
SPTO, on the date indicated below. 

September 20, 2004 

ftcate Jonas Pierre, Sr. 



Signature 



Mail Stop Patent Application 
Commissioner for Patents 
P. O. Box 1450 
Alexandria, VA 22313-1450 



SUBMISSION AND REQUEST FOR ENTRY 
OF VERIFIED TRANSLATION OF NON-ENGLISH LANGUAGE PAPERS 



Applicants submit herewith an English translation of U.S. Non-Provisional Application No. 
10/680,860 filed October 7, 2003. Also submitted herewith is a statement by the translator of the 
accuracy of the translation. 

This submission and request for entry is being made to satisfy the requirements under 
37 C.F.R. 1.78(a)(5). 

The Commissioner is hereby authorized to charge any additional fees which may be required 
by this paper, or credit any overpayment, to Account No. 18-1982. Two duplicate copies of this sheet 
are enclosed. 



Aventis Pharmaceuticals Inc. 
Patent Department 
Route #202-206 / P.O. Box 6800 
Bridgewater, New Jersey 08807-0800 
Telephone (908)231-3737 
Telefax (908)231-2626 



Respectfully submitted, 




F. Aaron Dubberley, Reg. No., 4^,001 
Attorney/Agent for Applicant 



Aventis Docket No. FRAV2002/0028 US NP 



UNITED STATES PATENT AND TRADEMARK OFFICE 



I, Susan ANTHONY BA, ACIS, 
Director of RWS Group Ltd, of Europa House, Marsham Way, Gerrards Cross, 
Buckinghamshire, England declare; 

1. That I am a citizen of the United Kingdom of Great Britain and Northern Ireland. 

2. That the translator responsible for the attached translation is well acquainted with the 
German and English languages. 

3. That the attached is, to the best of RWS Group Ltd knowledge and belief, a true 
translation into the English language of the specification in French filed with the application 
for a patent in the U.S.A. on October 7, 2003 

under the number 10/680,860 

4. That I believe that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are true; and further that these statements were 
made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and 
that such willful false statements may jeopardize the validity of the patent application in the 
United States of America or any patent issuing thereon. 



For and on behalf of RWS Group Ltd 
The 14th day of September 2004 



1 




POLYPEPTIDES INVOLVED IN THE BIOSYNTHESIS OF SPIRAMYCINS. 
NUCLEOTI DE SEQUENCES ENCODING THESE POLYPEPTIDES AND 

APPLICATIONS THEREOF 



5 The present invention relates to the isolation and identification of novel genes of 

the biosynthetic c pathway for spiramycins and to novel polypeptides involved in this 
biosynthesis. It also relates to the use of these genes for the purpose of increasing the 
levels of production and the purity of the spiramycin produced. 

The invention also relates to the use of these genes for constructing mutants which 
10 can lead to the synthesis of novel antibiotics or to derived forms of spiramycins. The 
invention also relates to the molecules produced through the expression of these genes 
and, finally, to pharmacologically active compositions of a molecule produced through 
the expression of such genes. 

Streptomyces are Gram-positive filamentous soil bacteria. They play an important 
15 role in the decomposition and mineralization of organic materials due to the great 
diversity of degrading enzymes that they secrete. They exhibit morphological 
differentiation phenomena which are unique in prokaryotes, accompanied by a metabolic 
differentiation characterized by the production of secondary metabolites having an 
extraordinary diversity of chemical structures and biological activities. Among these 
20 metabolites are the spiramycins produced naturally by the bacterium Streptomyces 
ambofaciens. 

Spiramycin is an antibiotic of the macrolide family which is of use both in 
veterinary medicine and in human medicine. Macrolides are characterized by the 
presence of a lactone ring onto which are grafted one or more sugars. Streptomyces 
25 ambofaciens naturally produces spiramycin I, II and III (cf. figure 1); however, the 
antibiotic activity of spiramycin I is clearly greater than that of spiramycins II and III 
(Liu et al. 9 1999). The spiramycin I molecule consists of a lactone-based macrocycle, 
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called platenolide, and three sugars: forosamine, mycaminose and mycarose 
(cf. figure 1). The antibiotic activity of spiramycins is due to inhibition of protein 
synthesis in prokaryotes via a mechanism involving binding of the antibiotic to the 
bacterial ribosome. 

5 Certain compounds which are members of the macrolide family and which also 

possess a lactone ring have given rise to uses outside the field of antibiotics. Thus, the 
product FK506 has immunosuppressor effects and offers perspectives of therapeutic 
application in the field of organ transplantation, of rheumatoid arthritis and, more 
generally, in pathological conditions related to autoimmune reactions. Other macrolides, 
10 such as avermectin, have insecticidal and anti -helminth activity. 

Many biosynthetic pathways, concerning antibiotics belonging to varied classes, 
and also other secondary metabolites (for a review, K, Chater, 1990), have to date 
already been studied in Streptomycetes. However, knowledge of the biosynthetic 
pathways for spiramycins is, to date, only very partial. 

15 Spiramycin biosynthesis is a complex process comprising many steps and 

involving many enzymes (Omura et al. 9 1979a, Omura et al., 1979b). Spiramycins 
belong to the large class of polyketides which includes complex molecules which are 
particularly abundant in microorganisms found in the soil. These molecules are grouped 
together not through structural analogy, but through a certain similarity in the steps of 

20 their biosynthetic pathway. Specifically, polyketides are produced by a complex series of 
reactions, which have in common the fact that, in their biosynthetic pathway, there is a 
series of reactions catalyzed by one or more enzymes called "polyketide synthases" 
(PKS). In Streptomyces ambofaciens, the biosynthesis of the lactone-based macrocycle 
of spiramycins (platenolide) is carried out by a series of eight modules encoded by five 

25 PKS genes (S. Kuhstoss, 1996, US patent 5,945,320). Spiramycins are obtained from 
this lactone ring. However, the various steps and enzymes involved in the synthesis of 
the sugars, and also their attachment to the lactone ring and the modification of this ring 
so as to obtain spiramycins still remain unknown to date. 
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US patent 5,514,544 describes the cloning of a sequence called srmR in 
Streptomyces ambofaciens. In that patent, the hypothesis that the srmR gene encodes a 
protein which regulates the transcription of the genes involved in macrolide biosynthesis 
is put forward. 

5 In 1987, Richardson and colleagues (Richardson et aL, 1987) showed that the 

spiramycin resistance of S. ambofaciens is imparted by at least three genes; said genes 
were called srmA, srmB and srmC. US patent 4,886,757 describes more particularly the 
characterization of a DNA fragment of S. ambofaciens containing the srmC gene. 
However, the sequence of this gene was not disclosed. In 1990, Richardson and 
10 colleagues (Richardson et aL, 1990) put forward the hypothesis that there are three genes 
for spiramycin biosynthesis close to srmB. US patent 5,098,837 reports the cloning of 
five genes potentially involved in spiramycin biosynthesis. These genes were named 
srmD, srmE, srmF, srmG and srmH. 

One of the major difficulties in producing compounds such as spiramycins lies in 
15 the fact that very large fermentation volumes are necessary to produce a relatively small 
amount of product. It is therefore desirable to be able to increase the efficiency of 
production of such molecules in order to decrease the cost of production thereof. 

The biosynthetic pathway for spiramycins is a complex process and it would be 
desirable to identify and to eliminate the parasitic reactions which might exist during 
20 this process. The aim of such a manipulation is to obtain a purer antibiotic and/or an 
improvement in productivity. In this respect, Streptomyces ambofaciens naturally 
produces spiramycin I, II and III (cf. figure 1); however, the antibiotic activity of 
spiramycin I is clearly greater than that of spiramycins II and III (Liu et aL, 1999). It 
would therefore be desirable to be able to have strains which produce only spiramycin I. 

25 Due to the commercial value of macrolide antibiotics, the production of novel 

derivatives, in particular of analogs of spiramycins having advantageous properties, is 
intensely sought. It would be desirable to be able to generate, in sufficient amount, the 
biosynthetic intermediates of the biosynthetic pathway for spiramycins or spiramycin 
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derivatives, in particular for the purpose of producing spiramycin-derived hybrid 
antibiotics. 



General description of the invention 

5 The present invention results from the cloning of genes the product of which is 

involved in spiramycin biosynthesis. The invention relates, first of all, to novel genes of 
the biosynthetic pathway for spiramycins and novel polypeptides involved in this 
biosynthesis. 

The genes of the biosynthetic pathway and the associated coding sequences have 
10 been cloned and the DNA sequence of each one has been determined. The cloned coding 
sequences will hereafter be designated orfl*c, or/2*c, or/3*c, orf4*c, or/5* orf6* 
or/7*c, or/8* or/9* or/70* or/7, orf2, or/3, orf4, or/5, or/6, or/7, or/8, orJ9c, or/70, 
orfllc, or/72, orf!3c, orfl4, or/7 5c, orfl6, orfl7, orfl8 t or/79, or/20, orfilc, orf22c, 
or/23c, or/24c, or/25c, or/26, or/27, or/28c, or/29, orfSOc, or/31, or/32c, or/33 and 
15 or/34c. The function of the proteins encoded by these sequences in the biosynthetic 
pathway for spiramycins is developed in the discussion hereinafter, which is illustrated 
by figures 4, 5, 6 and 8. 

1) A first subject of the invention concerns a polynucleotide encoding a 
polypeptide involved in spiramycin biosynthesis, wherein the sequence of said 
20 polynucleotide is: 

(a) one of the sequences SEQ ID No. 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 28, 
30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 
109, 111, 113, 115, 118, 120, 141, 143, 145, 147 and 149, 

(b) one of the sequences consisting of the variants of the sequences (a), 

25 (c) one of the sequence derived from the sequences (a) and (b) due to the 

degeneracy of the genetic code. 
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2) A subject of the present invention is also a polynucleotide which hybridizes, 
under high stringency hybridization conditions, to at least one of the polynucleotides 
according to paragraph 1) above. 

3) The invention also relates to a polynucleotide exhibiting at least 70%, 80%, 
5 85%, 90%, 95% or 98% nucleotide identity with a polynucleotide comprising at least 

10, 12, 15, 18, 20 to 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400, 
450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1150, 1200, 
1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 1850 or 
1900 consecutive nucleotides of a polynucleotide according to paragraph 1) above. 

10 4) The invention also relates to a polynucleotide according to paragraph 2) or 3) 

above, which is isolated from a bacterium of the genus Streptomyces. 

5) The invention also relates to a polynucleotide according to paragraph 2), 3) or 
4) above, which encodes a protein involved in the biosynthesis of a macrolide. 

6) The invention also relates to a polynucleotide according to paragraph 2), 3) or 
15 4) above, which encodes a protein having an activity similar to the protein encoded by 

the polynucleotide with which it hybridizes or with which it exhibits the identity. 

7) The invention also relates to a polypeptide resulting from the expression of a 
polynucleotide according to paragraph 1), 2), 3), 4), 5) or 6) above. 

8) Another aspect of the invention concerns a polypeptide involved in a 
20 spiramycin biosynthesis, wherein the sequence of said polypeptide is: 

(a) one of the sequences SEQ ID No. 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
27, 29, 31, 32, 33, 35, 37, 38, 39, 41, 42, 44, 46, 48, 50, 51, 52, 54, 55, 56, 57, 58, 59, 
61,63, 65, 67, 69,71,73,75, 77, 79,81,83, 85, 108, 110, 112, 114, 116, 117, 119, 121, 
142, 144, 146, 148 and 150, 
25 (b) one of the sequences as defined in (a), except that throughout said sequence, 

one or more amino acids have been substituted, inserted or deleted without affecting the 
functional properties thereof, 



(c) one of the sequences consisting of the variants of the sequences (a) and (b). 

9) Another subject of the invention concerns a polypeptide exhibiting at least 
70%, 80%, 85%, 90%, 95% or 98% amino acid identity with a polypeptide comprising 
at least 10, 15, 20, 30 to 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 
260, 280, 300, 320, 340, 360, 380, 400, 420, 440, 460, 480, 500, 520, 540, 560, 580, 
600, 620 or 640 consecutive amino acids of a polypeptide according to paragraph 8) 
above. 

10) Another aspect of the invention concerns a polypeptide according to 
paragraph 9) above, which is isolated from a bacterium of the genus Streptomyces. 

1 1 ) Another aspect of the invention concerns a polypeptide according to 
paragraph 9) or 10) above, which encodes a protein involved in the biosynthesis of a 
macrolide. 

1 2) Another aspect of the invention concerns a polypeptide according to pargraph 
9), 10) or 1 1) above, which has an activity similar to that of the polypeptide with which 
it shares the identity. 

13) Another aspect of the invention concerns a recombinant DNA, which 
comprises at least one polynucleotide according to one of paragraphs 1), 2), 3), 4), 5) 
and 6) above. 

14) Another aspect of the invention concerns a recombinant DNA according to 
paragraph 13) above, wherein said recombinant DNA is included in a vector. 

15) Another aspect of the invention concerns a recombinant DNA according to 
paragraph 14) above, wherein said vector is chosen from bacteriophages, plasmids, 
phagemids, integrative vectors, fosmids, cosmids, shuttle vectors, BACs and PACs. 

16) Another aspect of the invention concerns a recombinant DNA according to 
paragraph 15) above, which is chosen from pOS49.1, pOS49.11, pOSC49:12, 
pOS49.14, pOS49.16, pOS49.28, pOS44.1, pOS44.2, pOS44.4, pSPMS, pSPM7, 
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pOS49.67, pOS49.88, pOS49.106, pOS49.120, pOS49.107, pOS49.32, pOS49.43, 
pOS49.44, pOS49.50, pOS49.99, pSPM17, pSPM21, pSPM502, pSPM504, pSPM507, 
pSPM508, pSPM509, pSPMl, pBXLllll, pBXL1112, pBXL1113, pSPM520, 
pSPM521, pSPM522, pSPM523, pSPM524, pSPM525, pSPM527, pSPM528, 
5 pSPM34, pSPM35, pSPM36, pSPM37, pSPM38, pSPM39, pSPM40, pSPM41, 
pSPM42, pSPM43, pSPM44, pSPM45, pSPM47, pSPM48, pSPMSO, pSPMSl, 
pSPM52 9 pSPM53, pSPM55, pSPM56, pSPM58, pSPM72, pSPM73, pSPM515 5 
PSPM519, pSPM74 5 pSPM75, pSPM79 3 pSPM83, pSPM107, pSPM543 and pSPM106. 

17) Another aspect of the invention concerns an expression vector, which 
10 comprises at least one nucleic acid sequence encoding a polypeptide according to 

paragraph 7), 8), 9), 10), 1 1) or 12) above. 

18) The invention also relates to an expression system comprising a suitable 
expression vector and a host cell which allows the expression of one or more 
polypeptides according to paragraph 7), 8), 9), 10), 1 1) or 12) above. 

15 19) The invention also relates to an expression system according to paragraph 18 

above, which is chosen from prokaryotic expression systems and eukaryotic expression 
systems. 

20) The invention also relates to an expression system according to paragraph 1 9) 
above, which is chosen from systems of expression in the bacterium E. coli, baculovirus 

20 expression systems which allow expression in insect cells, expression systems which 
allow expression in yeast cells and expression systems which allow expression in 
mammalian cells. 

21) The invention also relates to a host cell into which at least one polypeptide 
and/or at least one recombinant DNA and/or at least one expression vector according to 

25 one of paragraphs 1), 2), 3), 4), 5), 6), 13), 14), 15), 16) and 17) above has been 
introduced. 
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22) The invention also relates to a method of producing a polypeptide according 
to paragraph 7), 8), 9), 10), 11) or 12) above, wherein said method comprises the 
following steps: 

a) inserting at least one nucleic acid encoding said polypeptide into a suitable 

5 vector; 

b) culturing, in a suitable culture medium, a host cell transformed or transfected 
beforehand with the vector of step a); 

c) recovering the conditioned culture medium or a cell extract; 

d) separating and purifying said polypeptide from said culture medium or else 
10 from the cell extract obtained in step c); 

e) where appropriate, characterizing the recombinant polypeptide produced. 

23) Another aspect of the invention concerns a microorganism blocked in a step 
of the biosynthetic pathway for at least one macrolide. 

24) Another aspect of the invention concerns a microorganism according to 
15 paragraph 23) above, which is obtained by inactivating the function of at least one 

protein involved in the biosynthesis of this (these) macrolide(s) in a microorganism 
which produces this (these) macroJide(s). 

25) Another aspect of the invention concerns a microorganism according to 
paragraph 24) above, wherein the inactivation of this (these) protein(s) is carried out by 

20 mutagenesis in the gene(s) encoding said protein(s) or by expression of one or more 
antisense RNAs complementary to the messenger RNA(s) encoding said protein(s). 

26) Another aspect of the invention concerns a microorganism according to 
paragraph 25) above, wherein the inactivation of this (these) protein(s) is carried out by 
mutagenesis via irradiation, by action of a mutagenic chemical agent, by site-directed 

25 mutagenesis or by gene replacement. 

27) Another aspect of the invention concerns a microorganism according to 
paragraph 25) or 26) above, wherein the mutagenesis or mutageneses is or are carried 
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out in vitro or in situ, by suppression, substitution, deletion and/or addition of one or 
more bases in the gene(s) under consideration or by gene inactivation. 

28) Another aspect of the invention concerns a microorganism according to 
paragraph 23), 24), 25), 26) or 27) above, wherein said microorganism is a bacterium of 

5 the genus Streptomyces. 

29) Another aspect of the invention concerns a microorganism according to 
paragraph 23), 24), 25), 26), 27) or 28) above, wherein the macrolide is spiramycin. 

30) Another aspect of the invention concerns a microorganism according to 
paragraph 23), 24), 25), 26), 27), 28) or 29) above, wherein said microorganism is a 

10 strain of S. ambofaciens. 

31) Another aspect of the invention concerns a microorganism according to 
paragraph 23), 24), 25), 26), 27), 28), 29) or 30) above, wherein the mutagenesis is 
carried out in at least one gene comprising a sequence according to one of paragraphs 
1), 2), 3), 4), 5) and 6) above. 

15 32) Another aspect of the invention concerns a microorganism according to 

paragraph 25), 26), 27), 28), 29), 30) or 31) above, wherein the mutagenesis is carried 
out in one or more genes comprising one of the sequences corresponding to one or more 
of the sequences SEQIDNo. 3, 5, 7, 9, 1 1, 13, 15, 17, 19,21,23,25,28, 30, 34,36, 40, 
43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 109, 1 1 1, 1 13, 

20 115, 118, 120, 141, 143, 145, 147 and 149. 

33) Another aspect of the invention concerns a microorganism according to 
paragraph 25), 26), 27), 28), 29), 30), 31) or 32) above, wherein the mutagenesis 
consists of the gene inactivation of a gene comprising a sequence corresponding to the 
sequence SEQ ID No. 13. 

25 34) Another aspect of the invention concerns a strain of Streptomyces 

ambofaciens, which is a strain chosen from one of the strains deposited with the 
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Collection Nationale de Cultures de Microorganismes [National Collection of Cultures 
and Microorganisms] (CNCM) on July 10, 2002, under the registration number 1-2909, 
1-291 1, 1-2912, 1-2913, 1-2914, 1-2915, 1-2916 or 1-2917. 

35) Another aspect of the invention concerns a method of preparing a macrolide 
5 biosynthesis intermediate, which comprises the following steps: 

a) culturing, in a suitable culture medium, a microorganism according to one of 
paragraphs 23), 24), 25), 26), 27), 28), 29), 30), 31), 32), 33) or 34) above, 

b) recovering the conditioned culture medium or a cell extract, 

c) separating and purifying said biosynthesis intermediate from said culture 
10 medium or else from the extract obtained in step b). 

36) Another aspect of the invention concerns a method of preparing a molecule 
derived from a macrolide, wherein a biosynthesis intermediate is prepared according to 
the method of paragraph 35) above and the intermediate thus produced is modified. 

37) Another aspect of the invention concerns a method of preparation according 
15 to paragraph 36) above, wherein said intermediate is modified chemically, 

biochemically, enzymatically and/or microbiologically. 

38) Another aspect of the invention concerns a method of preparation according 
to paragraph 36) or 37) above, wherein one or more genes encoding proteins capable of 
modifying the intermediate by using it as substrate is (are) introduced into said 

20 mi croorgani sm . 

39) Another aspect of the invention concerns a method of preparation according 
to paragraph 36), 37) or 38) above, wherein the macrolide is spiramycin. 

40) Another aspect of the invention concerns a method of preparation according 
to paragraph 36), 37), 38) or 39) above, wherein the microorganism used is a strain of 

25 5. ambofaciens. 
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41) Another aspect of the invention concerns a microorganism which produces 
spiramycin I but which does not produce spiramycin II and III 

42) Another aspect of the invention concerns a microorganism according to 
paragraph 41) above, which comprises all of the genes required for the biosynthesis of 
spiramycin I, but wherein the gene comprising the sequence SEQ ID No. 13 or one of 
its variants, or one of the sequences derived therefrom due to the degeneracy of the 
genetic code, and encoding a polypeptide of sequence SEQ ID No. 14 or one of its 
variants, is not expressed or has been made inactive. 

43) Another aspect of the invention concerns a microorganism according to 
paragraph 42) above, wherein said inactivation is carried out by mutagenesis in the gene 
encoding said protein or by the expression of an antisense RNA complementary to the 
messenger RNA encoding said protein. 

44) Another aspect of the invention concerns a microorganism according to 
paragraph 43) above, wherein said mutagenesis is carried out in the promoter of this 
gene, in the coding sequence or in a noncoding sequence so as to make the encoded 
protein inactive or to prevent its expression or its translation therefrom. 

45) Another aspect of the invention concerns a microorganism according to 
paragraph 43) or 44) above, wherein the mutagenesis is carried out by irradiation, by 
action of a mutagenic chemical agent, by site-directed mutagenesis or by gene 
replacement. 

46) Another aspect of the invention concerns a microorganism according to 
paragraph 43), 44) or 45) above, wherein the mutagenesis is carried out in vitro or in 
situ, by suppression, substitution, deletion and/or addition of one or more bases in the 
gene under consideration or by gene inactivation. 

47) Another aspect of the invention concerns a microorganism according to 
paragraph 41) or 42) above, wherein said microorganism is obtained by expressing the 
genes of the biosynthetic pathway for spiramycin without these genes comprising the 
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gene comprising the sequence corresponding to SEQ ID No. 13 or one of its variants, or 
one of the sequences derived therefrom due to the degeneracy of the genetic code, and 
encoding a polypeptide of sequence SEQ ID No. 14 or one of its variants. 

48) Another aspect of the invention concerns a microorganism according to 
paragraph 41), 42), 43), 44), 45), 46) or 47) above, wherein said microorganism is a 
bacterium of the genus Streptomyces. 

49) Another aspect of the invention concerns a microorganism according to 
paragraph 41), 42), 43), 44), 45), 46), 47) or 48) above, wherein said microorganism is 
obtained from a starting strain which produces spiramycins I, II and III. 

50) Another aspect of the invention concerns a microorganism according to 
paragraph 41), 42), 43), 44), 45), 46), 47), 48) or 49) above, which is obtained by 
mutagenesis in a gene comprising the sequence corresponding to SEQ ID No. 13 or one 
of its variants, or one of the sequences derived therefrom due to the degeneracy of the 
genetic code, and encoding a polypeptide of sequence SEQ ID No. 14 or one of its 
variants having the same function. 

51) Another aspect of the invention concerns a microorganism according to 
paragraph 41), 42), 43), 44), 45), 46), 47), 48), 49) or 50) above, wherein said 
microorganism is obtained from a strain of S. ambofaciens which produces spiramycins 
I, II and III, in which gene inactivation of the gene comprising the sequences 
corresponding to SEQ ID No. 1 3 or one of the sequences derived therefrom due to the 
degeneracy of the genetic code is carried out. 

52) Another aspect of the invention concerns a strain of S. ambofaciens, which is 
the strain deposited with the Collection Nationale de Cultures de Microorganismes 
(CNCM) on July 10, 2002, under the registration number 1-291 0. 

53) Another aspect of the invention concerns a method of producing spiramycin I, 
which comprises the following steps: 
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(a) culturing, in a suitable culture medium, a microorganism according to one of 
paragraphs 41), 42), 43), 44), 45), 46), 47), 48), 49), 50), 51) and 52) above, 

(b) recovering the conditioned culture medium or a cell extract, 

(c) separating and purifying the spiramycin I from said culture medium or else 
from the cell extract obtained in step b). 

54) Another aspect of the invention concerns the use of a polynucleotide 
according to one of paragraphs 1), 2), 3), 4), 5) and 6) above, for improving the 
macrolide production of a microorganism. 

55) Another aspect of the invention concerns a macrolide-producing mutant 
microorganism, which has a genetic modification in at least one gene comprising a 
sequence according to one of paragraphs 1), 2), 3), 4), 5) or 6) above and which 
overexpresses at least one gene comprising a sequence according to one of paragraphs 
1), 2), 3), 4), 5) and 6) above. 

56) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55) above, wherein the genetic modification consists of a suppression, a 
substitution, a deletion and/or an addition of one or more bases in the gene(s) under 
consideration, with the aim of expressing one or more proteins having greater activity 
or of expressing a higher level of this (these) protein(s). 

57) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55) above, wherein the overexpression of the gene under consideration is 
obtained by increasing the copy number for the gene and/or introducing a promoter 
which is more active than the wild-type promoter. 

58) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55) or 57) above, wherein the overexpression of the gene under 
consideration is obtained by transforming a macrolide-producing microorganism with a 
recombinant DNA construct according to paragraph 13, 14 or 17 above, allowing the 
overexpression of this gene. 
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59) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55), 56), 57) or 58) above, wherein the genetic modification is carried out 
in one or more genes comprising one of the sequences corresponding to one or more of 
the sequences SEQ ID No. 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 28, 30, 34, 36, 40, 

5 43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 109, 111,113, 
115, 118, 120, 141, 143, 145, 147 and 149, or one of its variants, or one of the 
sequences derived therefrom due to the degeneracy of the genetic code. 

60) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55), 56), 57), 58) or 59) above, wherein the microorganism overexpresses 

10 one or more genes comprising one of the sequences corresponding to one or more of the 
sequences SEQ ID No. 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 28, 30, 34, 36, 40, 43, 
45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 109, 1 1 1, 1 13, 1 15, 
118, 120, 141, 143, 145, 147 and 149, or one of its variants, or one of the sequences 
derived therefrom due to the degeneracy of the genetic code. 

15 61) Another aspect of the invention concerns a mutant microorganism according 

to paragraph 55), 56), 57), 58), 59) or 60) above, wherein said microorganism is a 
bacterium of the genus Streptomyces. 

62) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55), 56), 57), 58), 59), 60) or 61) above, wherein the macrolide is 

20 spiramycin. 

63) Another aspect of the invention concerns a mutant microorganism according 
to paragraph 55), 56), 57), 58), 59), 60), 61) or 62) above, wherein said microorganism 
is a strain of S. ambofaciens. 

64) Another aspect of the invention concerns a method of producing macrolides, 
25 which comprises the following steps: 

(a) culturing, in a suitable culture medium, a microorganism according to one of 
paragraphs 55), 56), 57), 58), 59), 60), 61), 62), 63) and 64) above, 
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(b) recovering the conditioned culture medium or a cell extract, 

(c) separating and purifying said macrolide(s) produced from said culture 
medium or else from the cell extract obtained in step b). 

65) Another aspect of the invention concerns the use of a sequence and/or of a 
recombinant DNA and/or of a vector according to one of paragraphs 1), 2), 3), 4), 5), 6), 
7), 8), 9), 10), 11), 12), 13), 14), 15), 16) and 17) above, for preparing hybrid 
antibiotics. 

66) Another aspect of the invention concerns the use of at least one 
polynucleotide and/or at least one recombinant DNA and/or at least one expression 
vector and/or at least one polypeptide and/or at least one host cell according to one of 
paragraphs 1), 2), 3), 4), 5), 6), 7), 8), 9), 10), 11), 12), 13), 14), 15), 16), 17) and 21) 
above, for carrying out one or more bioconversions. 

67) Another aspect of the invention concerns a polynucleotide which is a 
polynucleotide complementary to one of the polynucleotides according to paragraph 1), 
2), 3), 4), 5) or 6) above. 

68) Another aspect of the 'invention concerns a microorganism producing at 
least one spiramycin, which overexpresses: 

- a gene which can be obtained by polymerase chain reaction (PCR) using the 
following pair of sequence primers: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 

5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) 

and, as matrix, the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, 

- or a gene derived therefrom due to the degeneracy of the genetic code. 

69) Another aspect of the invention concerns a microorganism according to 
paragraph 68 or 90, which is a bacterium of the genus Streptomyces. 
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70) Another aspect of the invention concerns a microorganism according to 
paragraph 68, 69 or 90, which is a bacterium of the species Streptomyces 
ambofaciens. 

71) Another aspect of the invention concerns a microorganism according to 
5 paragraph 68, 69, 70 or 90, wherein the overexpression of said gene is obtained by 

transformation of said microorganism with an expression vector. 

72) Another aspect of the invention concerns a strain of Streptomyces 
ambofaciens, which is the strain OSC2/pSPM75(l) or the strain OSC2/pSPM75(2) 
deposited with the Collection Nationale de Cultures de Microorganismes (CNCM) 

10 [National Collection of Cultures and Microorganisms] Pasteur Institute, 25, rue du 
Docteur Roux 75724 Paris Cedex 15, France, on October 6, 2003, under the 
registration number 1-3101. 

73) Another aspect of the invention concerns a recombinant DNA which 
comprises: 

15 - a polynucleotide which can be obtained by polymerase chain reaction using 

the following pair of sequence primers: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 
5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) and, as 
matrix, the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, 
20 - or a fragment of at least 10, 12, 15, 18, 20 to 25, 30, 40, 50, 60, 70, 80, 90, 

100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 
950, 1000, 1050, 1100, 1150, 1200, 1250, 1300, 1350, 1400, 1450, 1460, 1470, 
1480, 1490 or 1500 consecutive nucleotides of this polynucleotide. 

74) Another aspect of the invention concerns a recombinant DNA according 
25 to paragraph 73 or 91 , which is a vector. 

75) Another aspect of the invention concerns a recombinant DNA according 
to paragraph 73, 74 or 91, which is an expression vector. 



17 



76) Another aspect of the invention concerns a host cell into which at least 
one recombinant DNA according to one of paragraphs 73, 74, 75 and 91 has been 
introduced. 

77) Another aspect of the invention concerns a method of producing a 
polypeptide, wherein said method comprises the following steps: 

a) transforming a host cell with at least one expression vector according to 
paragraph 75; 

b) cultivating, in a suitable culture medium, said host cell; 

c) recovering the conditioned culture medium or a cell extract; 

d) separating and purifying said polypeptide from said culture medium or else 
from the cell extract obtained in step c); 

e) where appropriate, characterizing the recombinant polypeptide produced. 

78) Another aspect of the invention concerns a microorganism according to 
paragraph 51, wherein the gene inactivation is carried out by in-phase deletion of the 
gene or of a part of the gene comprising the sequence corresponding to SEQ ID 
No. 13 or one of the sequences derived therefrom due to the degeneracy of the 
genetic code. 

79) Another aspect of the invention concerns a microorganism according to 
one of paragraphs 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51 and 78, which also 
overexpresses: 

- a gene which can be obtained by polymerase chain reaction using the following pair 
of sequence primers: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 

5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) and, as 

matrix, the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, 

- or a gene derived therefrom due to the degeneracy of the genetic code. 

80) Another aspect of the invention concerns an expression vector, wherein 
the polynucleotide of sequence SEQ ID No. 47, or a polynucleotide derived 
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therefrom due to the degeneracy of the genetic code, is placed under the control of a 
promoter which allows the expression of the protein encoded by said polynucleotide 
in Streptomyces ambofaciens, 

81) Another aspect of the invention concerns an expression vector according 
to paragraph 80, which is the plasmid pSPM524 or pSPM525. 

82) Another aspect of the invention concerns a strain of Streptomyces 
ambofaciens transformed with a vector according to paragraph 80 or 81. 

83) Another aspect of the invention concerns a microorganism according to 
one of paragraphs 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 78, 79 and 92, which also 
overexpresses the gene having coding sequence SEQ ID No. 47 or a coding sequence 
derived therefrom due to the degeneracy of the genetic code. 

84) Another aspect of the invention concerns a microorganism according to 
paragraph 83, which is the strain SPM502 pSPM525 deposited with the Collection 
Nationale de Cultures de Microorganismes (CNCM) Pasteur Institute, 25 rue du 
Docteur Roux 75724 Paris Cedex 15, France, on February 26, 2003, under the 
registration number 1-2977. 

85) Another aspect of the invention concerns a method of producing 
spiramycin(s), which comprises the following steps: 

(a) culturing, in a suitable culture medium, a microorganism according to one of 
paragraphs 68, 69, 70, 71, 72, 78, 79, 82, 83, 84, 90 and 92, 

(b) recovering the conditioned culture medium or a cell extract, 

(c) separating and purifying the spiramycins from said culture medium or else 
from the cell extract obtained in step b). 

86) Another aspect of the invention concerns a polypeptide, the sequence of 
which comprises the sequence SEQ ID No. 1 12 or the sequence SEQ ID No. 142. 
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87) Another aspect of the invention concerns a polypeptide, the sequence of 
which corresponds to the sequence translated from the coding sequence: 

-of a gene which can be obtained by polymerase chain reaction (PCR) using 
the following pair of sequence primers: 
5 5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 

5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) and, as 
matrix, the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, 

-or of a gene derived therefrom due to the degeneracy of the genetic code. 

88) Another aspect of the invention concerns an expression vector which allows 
10 the expression of a polypeptide according to paragraph 86, 87 or 93 in Streptomyces 

ambofaciens. 

89) Another aspect of the invention concerns an expression vector according to 
paragraph 88, which is the plasmid pSPM75. 

90) Another aspect of the invention concerns a microorganism according to 
15 paragraph 68, wherein the gene which can be obtained by polymerase chain 

amplification is the gene of coding sequence SEQ ID No. 141, or a gene derived 
therefrom due to the degeneracy of the genetic code. 

91) Another aspect of the invention concerns a recombinant DNA according 
to paragraph 73, wherein the polynucleotide which can be obtained by polymerase 

20 chain amplification is a polynucleotide of sequence SEQ ID No. 141 . 

92) Another aspect of the invention concerns a microorganism according to 
paragraph 79, wherein the gene which can be obtained by polymerase chain 
amplification is the gene of coding sequence SEQ ID No. 141, or a gene derived 
therefrom due to the degeneracy of the genetic code. 
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93) Another aspect of the invention concerns a polypeptide, the sequence of 
which is SEQ ID No. 142. 



20 



General definitions 

For the purpose of the present invention, the term "isolated" denotes a biological 
material (nucleic acid or protein) which has been removed from its original environment 
(the environment in which it is naturally located). 

For example, a polynucleotide present in the natural state in a plant or an animal is 
not isolated. The same polynucleotide separated from the adjacent nucleic acids within 
which it is naturally inserted in the genome of the plant or the animal is considered to be 
"isolated". 

Such a polynucleotide can be included in a vector and/or such a polynucleotide can 
be included in a composition, and can nevertheless remain in the isolated state due to the 
fact that the vector or the composition does not constitute its natural environment. 

The term "purified" does not require the material to be present in a form of 
absolute purity, excluding the presence of other compounds. It is rather a relative 
definition. 

A polynucleotide in the "purified" state after purification of the starting material or 
of the natural material by at least an order of magnitude, preferably 2 or 3, and 
preferentially 4 or 5, orders of magnitude. 

For the purposes of the present invention, the term ORF ("Open Reading Frame") 
has been used to denote in particular the coding sequence of a gene. 

For the purposes of the present invention, the expression "nucleotide sequence" 
can be used to denote equally a polynucleotide or a nucleic acid. The expression 
"nucleotide sequence" encompasses the genetic material itself and is not therefore 
restricted to the information concerning its sequence. 
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The terms "nucleic acid", "polynucleotide", "oligonucleotide" or alternatively 
"nucleotide sequence" encompass RNA, DNA or cDNA sequences or RN A/DNA hybrid 
sequences of more than one nucleotide, equally in the single-chain form or in the form 
of a duplex. 

The term "nucleotide" denotes both the natural nucleotides (A, T, G, C) and also 
modified nucleotides which comprise at least one modification, such as (1) a purine 
analog, (2) a pyrimidine analog, or (3) a sugar analog, examples of such modified 
nucleotides being described, for example, in PCT application No. WO 95/04064. 

For the purposes of the present invention, a first polynucleotide is considered to be 
"complementary" to a second polynucleotide when each base of the first polynucleotide 
is paired to the complementary base of the second polynucleotide, the orientation of 
which is reversed. The complementary bases are A and T (or A and U), or C and G. 

The term "genes of the biosynthetic pathway for spiramycins" also comprises the 
regulatory genes and the genes imparting resistance to the producer microorganisms. 

According to the invention, the term "fragment" of a reference nucleic acid will be 
intended to mean a nucleotide sequence which is shorter compared to the reference 
nucleic acid and comprises, on the common portion, a nucleotide sequence identical to 
the reference nucleic acid. 

According to the invention, such a nucleic acid "fragment" may, where 
appropriate, be included in a larger polynucleotide of which it is a constituent. 

Such fragments comprise, or alternatively consist of, polynucleotides of length 
ranging from 8, 10, 12, 15, 18, 20 to 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 
300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1 100, 
1150, 1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 
1 850 or 1900 consecutive nucleotides of a nucleic acid according to the invention. 
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According to the invention, the term "fragment" of a polypeptide will be intended 
to mean a polypeptide the amino acid sequence of which is shorter than that of the 
reference polypeptide and which comprises, over the entire portion common with these 
reference polypeptides, an identical amino acid sequence. 

Such fragments may, where appropriate, be included in a larger polypeptide of 
which they are part. 

Such fragments of a polypeptide according to the invention may be 10, 15, 20, 30 
to 40, 50, 60, 70, 80, 90,100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 320, 340, 
360, 380, 400, 420, 440, 460, 480, 500, 520, 540, 560, 580, 600, 620 or 640 amino acids 
in length. 

For the purpose of the present invention, the expression "high stringency 
hybridization conditions" will be intended to mean hybridization conditions which are 
unfavorable to the hybridization of nonhomologous nucleic acid strands. High 
stringency hybridization conditions can, for example, be described as conditions of 
hybridization in the buffer described by Church & Gilbert (Church & Gilbert, 1984) at a 
temperature of between 55°C and 65°C; preferably the hybridization temperature is 
55°C, even more preferably the hybridization temperature is 60°C, and most preferably 
the hybridization temperature is 65°C, followed by one or more washes carried out in 
2X SSC buffer (IX SSC buffer corresponds to an aqueous solution of 0.15M NaCl, 
15 mM of sodium citrate) at a temperature of between 55°C and 65°C; preferably this 
temperature is 55°C, even more preferably this temperature is 60°C, and most preferably 
this temperature is 65°C, followed by one or more washes in 0.5X SSC buffer at a 
temperature of between 55°C and 65°C; preferably this temperature is 55°C, even more 
preferably this temperature is 60°C, and most preferably this temperature is 65°C. 

It goes without saying that the hybridization conditions described above can be 
adjusted as a function of the length of the nucleic acid the hybridization of which is 
sought, or of the type of labeling chosen, according to techniques known to those skilled 
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in the art. The suitable hybridization conditions can, for example, be adjusted according 
to the work by F. Ausubel et al (2002). 

The term Variant" of a nucleic acid according to the invention will be intended to 
mean a nucleic acid which differs by one or more bases compared to the reference 
polynucleotide. A variant nucleic acid may be of natural origin, such as a naturally found 
allelic variant, or may also be a non-natural variant obtained, for example, by 
mutagenesis techniques. 

In general, the differences between the reference nucleic acid and the variant 
nucleic acid are small such that the nucleotide sequences of the reference nucleic acid 
and of the variant nucleic acid are very close and, in many regions, identical. The 
nucleotide modifications present in a variant nucleic acid may be silent, which means 
that they do not modify the amino acid sequences encoded by said variant nucleic acid. 

However, the nucleotide changes in a variant nucleic acid may also result in 
substitutions, additions and/or deletions in the polypeptide encoded by the variant 
nucleic acid compared to the peptides encoded by the reference nucleic acid. In addition, 
nucleotide modifications in the coding regions may produce conservative or 
nonconservative substitutions in the amino acid sequence. 

Preferably, the variant nucleic acids according to the invention encode 
polypeptides which conserve substantially the same function or biological activity as the 
polypeptide of the reference nucleic acid, or else the ability to be recognized by 
antibodies against the polypeptides encoded by the initial nucleic acid. 

Some variant nucleic acids will thus encode mutated forms of the polypeptides, the 
systematic study of which will make it possible to deduce structure-activity relationships 
of the proteins in question. 

The term 'Variant" of a polypeptide according to the invention will mainly be 
intended to mean a polypeptide the amino acid sequence of which contains one or more 
substitutions, additions or deletions of at least one amino acid residue, compared to the 
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amino acid sequence of the reference polypeptide, it being understood that the amino 
acid substitutions can be indifferently conservative or nonconservative. 

Preferably, the variant polypeptides according to the invention conserve 
substantially the same function or biological activity as the reference polypeptide, or else 
5 the ability to be recognized by antibodies against the initial polypeptide. 

For the purpose of the invention, polypeptide having "an activity similar" to a 
reference polypeptide is intended to mean a polypeptide having a biological activity 
close, but not necessarily identical, to that of the reference polypeptide as measured in a 
biological assay suitable for measuring the biological activity of the reference 
10 polypeptide. 

For the purpose of the invention, the term "hybrid antibiotic" is intended to mean a 
compound generated by constructing an artificial biosynthetic pathway using 
recombinant DNA technology. 
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Detailed description of the invention 

A subject of the present invention is more particularly novel genes of the 
biosynthetic pathway for spiramycins and novel polypeptides involved in this 
biosynthesis as presented in the detailed description below. 

The genes of the biosynthetic pathway were cloned and the DNA sequence of 
these genes was determined. The sequences obtained were analyzed using the FramePlot 
program (J. Ishikawa & K. Hotta, 1999). Among the open reading frames, those 
exhibiting a codon usage typical of Streptomyces were identified. This analysis showed 
that this region comprises 44 ORFs located on either side of five genes encoding the 
enzyme "polyketide synthase" (PKS), and exhibiting a codon usage typical of 
Streptomyces. On either side of these five genes encoding PKSs, 10 and 34 ORFs, 
respectively, were identified downstream and upstream (downstream and upstream 
being defined by the orientation of the 5 PKS genes all oriented in the same direction) 
(cf. figures 3 and 37). Thus, the 34 open reading frames of this type, occupying a region 
of approximately 41 .7 kb (cf. SEQ ID No. 1 exhibiting a first region of 31 kb containing 
25 ORFs and SEQ ID No. 140 exhibiting a region of approximately 12.1 kb, 1.4 kb of 
which overlap the preceding sequence (SEQ ID No. 1) and approximately 10.7 kb of 
which correspond to the subsequent sequence, the latter portion of approximately 
10.7 kb containing 9 additional ORFs (including one ORF of partial sequence) cf. also 
figures 3 and 37 below), were identified upstream of the 5 genes encoding the PKSs, and 
10 occupying a region approximately 11.1 kb (SEQ ID No. 2 and figure 3) were 
identified downstream of the PKS genes. The 10 genes located downstream of the 5 
PKS genes were thus named orfl*c, orf2*c t orf3*c, orf4*c, orf5* orf6* orf7*c, or/8* 
or/9* and orflO* (SEQ ID No. 3, 5, 7, 9, 1 1, 13, 15, 17, 19 and 21). The "c" added in the 
name of the gene signifies, for the ORF in question, that the coding sequence is in the 
reverse orientation (the coding strand is therefore the strand complementary to the 
sequence given in SEQ ID No. 2 for these genes). Using the same nomenclature, the 34 
ORFs upstream of the PKS genes were named orfl, orfl, or/3, orf4 t orf5, orf6 t orfl, 
or/8, orf9c, orflO, orfllc, orfl 2, orfl 3c, orfl 4, orfl 5c, orfl 6, orfl 7, orfl 8, orfl 9, orf20, 
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or/21c, orf22c, or/23c, orf24c, orf25c, orf26, orf27, orf28c, orf29 t orf30c t orfil, orj32c, 
or/33 and orf34c (SEQ ID Nos 23, 25, 28, 30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 109, 111, 113, 115, 118, 120, 141, 143, 145, 
147 and 149) (cf. figures 3 and 37). 

5 The protein sequences deduced from these open reading frames were compared 

with those present in various databases using various programs: BLAST (Altschul et aL, 
1990) (Altschul et aL, 1997), CD-search, COGs (Cluster of Orthologous Groups) (these 
three programs are accessible in particular from the National Center for Biotechnology 
Information (NCBI) (Bethesda, Maryland, USA)), FASTA ((W.R. Pearson & D.J. 
10 Lipman, 1988) and (W.R. Pearson, 1990), BEAUTY (K.C. Worley et al., 1995)), (these 
two programs are accessible in particular from the INFOBIOGEN resource center, Evry, 
France). These comparisons made it possible to formulate hypotheses regarding the 
function of the products of these genes and to identify those liable to be involved in 
spiramycin biosynthesis. 

15 Genes located downstream of the genes encoding the PKSs 

A diagrammatic representation of the organization of the region is given in 
figure 3. As will be demonstrated; below, on the 10 genes identified downstream of the 
genes encoding the PKSs, 9 appear to be involved in the biosynthesis of or the resistance 
to spiramycins. They are the following 9 genes: orfl*c, or/2*c t orJ3*c, orf4*c, orf5* 
20 orf6* orp*c t or/8* and orj9* 



In table 1 below the references to the DNA sequence and the amino acid sequence 
of the 10 genes identified downstream of the 5 PKS genes are given. 

Table 1 



Gene 


Position in the sequence 
SEQ ID No. 2 


DNA sequence 


Polypeptide sequences 


orfl*c 


10882 to 10172 


SEQ ID No. 3 


SEQ ID No. 4 


orf2*c 


10052 to 8781 j 


SEQ ID No. 5 


SEQ ID No. 6 
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or/3*c 


8741 to 7476 


SEQ ID No. 7 


SEQ ID No. 8 


orf4*c 


7459 to 6100 


SEQ ID No. 9 


SEQ ID No. 10 


orfi* 


5302 to 5976 


SEQ ID No. 1 1 


SEQ ID No. 12 


or/6* 


4061 to 5305 


SEQ ID No. 13 


SEQ ID No. 14 


orJ7*c 


3665 to 2817 


SEQ ID No. 15 


SEQ ID No. 16 


or/8* 


1925 to 2755 


SEQ ID No. 17 


SEQ ID No. 18 


or/9* 


1007 to 1888 


SEQ ID No. 19 


SEQ ID No. 20 


or/10* 


710 to 937 


SEQ ID No. 21 


SEQ ID No. 22 



The "c" added to the name of the gene indicates that the coding sequence is in the 
reverse orientation (the coding strand is therefore the strand complementary to the 
sequence given in SEQ ID No. 2 for these genes). 



With the aim of determining the function of the polypeptides identified, three 
5 types of experiment were carried out: comparison of the identified sequences with 
sequences of known functions, gene inactivation experiments, leading to the 
construction of mutant strains, and analyses of the production of spiramycins and of 
spiramycin biosynthesis intermediates by these mutant strains. 

The protein sequences deduced from these open reading frames were first of all 
10 compared with those present in various databases using various programs: BLAST 
(Altschul et al, 1990) (Altschul et aL 9 1997), CD-search, COGs (Cluster of Orthologous 
Groups), FASTA ((W.R. Pearson & D.J. Lipman, 1988) and (W.R. Pearson, 1990), 
BEAUTY (K.C. Worley et al % 1995)). These comparisons made it possible to formulate 
hypotheses regarding the function of the products of these genes and to identify those 
15 liable to be involved in spiramycin biosynthesis. Table 2 shows the proteins exhibiting 
strong similarity with the products of the 10 genes located downstream of the 5 PKS 
genes. 
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Table 2 



Gene 
product 


Protein exhibiting 
significant similarity 


GenBank 
accession number 


BLAST* 
score 


Reported function 


orfl*c 


TylMI(orf3*) (S.fradiae) 


CAA57473 


287 


N- 

methyltransferase 


or/2*c 


dnrQ gene product 
(Streptomyces peucetius) 


AAD 15266 


153 


unknown 


orf3*c 


TylMII(orf2*) {S.fradiae) 


CAA57472 


448 


Glvcosvltransferase 


orf4*c 


Crotonyl-CoA reductase 
(S. coelicolor) 


NP_630556 


772 


Crotonyl-CoA 
reductase 


or/5* 


MdmC 

(S. mycarofaciens) 


B42719 


355 


O- 

methyltransferase 


orf6* 


3-O-acyltransferase 
(S. mycarofaciens) 


Q00718 


494 


Acyltransferase 


orf7*c 


MdmA 

(S. mycarofaciens) 


A60725 


380 


Protein involved in 

midecamycin 

resistance 


or/8* 


ABC-transporter 
(S. griseus) 


CAC22119 


191 


ABC-transporter 


or/9* 


ABC-transporter 
(S. griseus) 


CAC22118 


269 


ABC-transporter 


or/70* 


Putative small conserved 
hypothetical protein 
(S. coelicolor) 


NP_627432 


109 


unknown 



* a greater sequence similarity is associated with a higher BLAST score (Altschul 
etai, 1990). 



Gene inactivation experiments were carried out in order to confirm these results. The 
5 methods used consist in performing a gene replacement. The target gene to be 
interrupted is replaced with a copy of this gene interrupted with a cassette imparting 
resistance to an antibiotic (for example apramycin, geneticin or hygromycin). The 
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cassettes used are bordered on either side by translation termination codons in all the 
reading frames and by transcription terminators which are active in Streptomyces. 
Insertion of the cassette into the target gene may or may not be accompanied by a 
deletion in this target gene. The size of the regions flanking the cassette may range from 
5 a few hundred to several thousand base pairs. A second type of cassette can be used for 
the gene inactivation: cassettes termed "excisable cassettes". These cassettes have the 
advantage of being able to be excised in Streptomyces by a site-specific recombination 
event after having been introduced into the genome of S. ambofaciens. The aim is to 
inactivate certain genes in strains of Streptomyces without leaving, in the final strain, 

10 selection markers or large DNA sequences not belonging to the strain. After excision, 
only a short sequence of about 30 base pairs (called "cicatricial" site) remains in the 
genome of the strain (cf. figure 10). The use of this system consists, initially, in 
replacing the wild-type copy of the target gene (by virtue of two homologous 
recombination events, cf. figure 9) with a construct in which an excisable cassette has 

15 been inserted into this target gene. The insertion of this cassette is accompanied by a 
deletion in the target gene (cf. figure 9). Secondly, the excision of the excisable cassette 
from the genome of the strain is brought about. The excisable cassette functions by 
virtue of a system of site-specific recombination and has the advantage of making it 
possible to obtain Streptomyces mutants which do not in the end carry a resistance gene. 

20 Possible polar effects on the expression of the genes located downstream of the 
inactivated gene(s) are also avoided (cf. figure 10). The strains thus constructed were 
tested for their spiramycin production. 

The orfl*c, orf2*c, orft*c and orf4*c genes were not inactivated since the 
sequence comparison experiments made it possible to determine that these genes had a 

25 relatively high similarity with genes involved in the biosynthesis of a relatively close 
antibiotic. Thus, the orfl*c gene encodes a protein exhibiting 66% identity (determined 
using the BLAST program) with the protein encoded by the tylMl gene which encodes 
an N-methyltransferase which is involved in the biosynthesis of tylosine and which 
catalyzes 3-N-methylation during the production of mycaminose in Streptomyces fradiae 

30 (A.R. Gandecha et aL, 1997; GenBank accession number: CAA57473; BLAST score: 
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287). This similarity with a protein involved in the biosynthetic pathway for another 
antibiotic relatively close, and more particularly in the biosynthesis of mycaminose, 
suggests that the orfl*c gene encodes an N-methyltransferase responsible for an 
N-methylation during the biosynthesis of forosamine or of mycaminose (cf figures 5 
5 and 6). This hypothesis is supported by the fact that the protein encoded by the orfl *c 
gene exhibits strong similarity with other proteins of similar function in other organisms 
(cf. table 3). 



Table 3 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


methyltransferase (5. antibioticus) 


CAA05643 


277 


methyltransferase 


N,N-dimethyltransferase (5. venezuelae) 


AAC68678 


268 


N,N- 

dimethyltransferase 


probable N-methylase snogX 
(S. nogalater) 


T46679 


243 


N-methyltransferase 



* a greater sequence similarity is associated with a higher BLAST score (Altschul 



10 et al, 1990). 

The orf2*c gene encodes a protein exhibiting relatively strong similarity (35% 
identity) with a protein encoded by the tylMIII gene encoding an NDP hexose 
3,4-isomerase involved in the biosynthesis of tylosine in Streptomyces fradiae 
(A.R. Gandecha et aL, 1997; GenBank accession number: CAA57471; BLAST score: 
15 130). This similarity with a protein involved in the biosynthetic pathway of another 
close antibiotic, and more particularly in the biosynthesis of mycaminose, strongly 
suggests that the o//*2c gene encodes an NDP hexose 3,4-isomerase responsible for 
isomerization during the biosynthesis of one of the sugars of spiramycin, possibly 
mycaminose (cf. figures 5 and 6). 
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The or/3*c gene encodes a protein exhibiting relatively strong similarity (59% 
identity) with a protein encoded by the tylMII gene encoding a glycosyltransferase 
involved in tylosine biosynthesis in Streptomyces fradiae (A.R. Gandecha et aL, 1997; 
GenBank accession number: CAA57472; BLAST score: 448). This similarity with a 
5 protein involved in the biosynthetic pathway for another close antibiotic strongly 
suggests that the orf*3c gene encodes a glycosyltransferase. This hypothesis is supported 
by the fact that the protein encoded by the or/3*c gene exhibits strong similarity with 
other proteins of similar function in other organisms (cf. table 4). 



Table 4 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Glycosyltransferase (S. venezuelae) 


AAC68677 


426 


Glycosyltransferase 


Glycosyltransferase (S. antibioticus) 


CAA05642 


425 


Glycosyltransferase 


Glycosyltransferase 
{Saccharopolyspora erythraea) 


CAA74710 


395 


Glycosyltransferase 


Glycosyltransferase (5. antibioticus) 


CAA05641 


394 


Glycosyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 



The orf4*c gene encodes a protein exhibiting relatively strong similarity with 
several crotonyl-CoA reductases. In particular, the protein encoded by or/4*c has 
considerable similarity with a crotonyl-CoA reductase from Streptomyces coelicolor 
15 (M. Redenbach et aL, 1996; GenBank accession number: NP_630556; BLAST score: 
772). This similarity with a protein involved in the biosynthetic pathway for another 
close antibiotic strongly suggests that the orf4*c gene also encodes a crotonyl-CoA 
reductase. This hypothesis is supported by the fact that the protein encoded by the orf4*c 



32 



gene exhibits strong similarity with other proteins of similar function in other organisms 
(cf. table 5). 

Table 5 



Protein exhibiting significant 
similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


trans-2-enoyl-CoA reductase 
(EC 1.3. 1.3 8) (S. collinus) 


S72400 


764 


trans-2-enoyl-CoA 
reductase 


Crotonyl-CoA reductase (S.fradiae) 


CAA57474 


757 


Crotonyl-CoA reductase 


Crotonyl-CoA reductase 
(S. cinnamonensis) 


AAD53915 


747 


Crotonyl-CoA reductase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



5 a/., 1990). 

The or/6* gene exhibits a certain similarity with the mdmB gene present in 
Streptomyces mycarofaciens (Hara and Hutchinson, 1992; GenBank accession number: 
A42719; BLAST score: 489) which produces macrolide antibiotic. In this producer, the 
gene is involved in the acylation of the lactone ring. The orf6* gene is therefore thought 
10 to encode an acyltransferase. This hypothesis is supported by the fact that the protein 
encoded by the or/6* gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 6). 
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Table 6 



Protein exhibiting 
significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


A cyA (Streptomyces 
thermotolerans) 


J4001 


450 


macrolide 3 -O-acyltransferase 


Midecamycin 4"-0- 

propionyltransferase 

(5. mycarofaciens) 


BAA09815 


234 


Midecamycin 4"-0- 
propionyltransferase 


Mycarose O-acyltransferase 
(Micromonospora 
megalomicea subsp. Nigra) 


AAG13909 


189 


Mycarose O-acyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
a/., 1990). 



Inactivation of the orf6* gene was produced by an in-phase deletion/inversion and 
5 - it showed that the resulting strain no longer produces spiramycin II and III but only 
spiramycin I (cf. figure 1). This confirms that the or/6* gene is indeed involved in the 
synthesis of spiramycin II and III. The enzyme encoded by this gene is responsible for 
the formation of spiramycin II and HI by attachment of an acetyl or butyryl group to the 
carbon in the 3-position. The strains which no longer express the protein encoded by the 
10 or/6* gene are particularly advantageous since they no longer produce spiramycin II and 
III but only spiramycin I. As specified above, the antibiotic activity of spiramycin I is 
clearly greater than that of spiramycins II and III (Liu et al. 9 1999). 

The or/5* gene encodes a protein exhibiting relatively strong similarity with 
several O-methyltransferases. In particular, the protein encoded by orf5* has 
15 considerable similarity with an O-methyltransferase (EC 2.1.1.-) MdmC from 
Streptomyces mycarofaciens (Hara & Hutchinson, 1992; GenBank accession number: 
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B42719; BLAST score: 355). This similarity with a protein involved in the biosynthetic 
pathway for another antibiotic strongly suggests that the orfS* gene also encodes an 
O-methyltransferase. The or/3* gene is thought to be involved in the formation of 
precursors incorporated into the lactone ring. In fact, according to the sequence 
comparisons, the product of the or/5* gene is also relatively close to FkbG, which is 
responsible for the methylation of hydroxymalonyl-ACP according to (Wu et aL 9 2000; 
Hoffmeister et a/., 2000; GenBank accession number: AAF86386; BLAST score: 247) 
(cf. figure 8). This hypothesis is supported by the fact that the protein encoded by the 
or/5* gene exhibits strong similarity with other proteins of similar function in other 
organisms (cf. table 7). 



Table 7 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Probable O-methyltransferase 

(EC 2.1 .1 .-) safC (Myxococcus xanthus) 


T18553 


223 


O-methyltransferase 


4-O-methyltransferase (EC 2.1.1.-)- 
(Streptomyces sp.) 


JC4004 


222 


O-methyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL 9 1990). 



Due to the polar effect of the insertion of a nonexcised cassette into the orf 6* 
gene, it has been possible to determine that the orf 5* gene is essential to the 
biosynthetic pathway for spiramycins. Specifically, insertion of the excisable cassette 
into the coding portion of the orf 6* gene leads to a complete arrest of spiramycin 
production. However, once the inserted cassette has been excised (and therefore when 
only the orf6* gene is inactivated, cf. examples 14 and 15), production of spiramycin I is 
observed again. This shows that the orf 5* gene is essential to the biosynthetic pathway 
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for spiramycins since its inactivation leads to a complete arrest of spiramycin 
production. 

The orf5* gene encodes a protein exhibiting relatively strong similarity with 
several O-methyltransferases. The orf5* gene is thought to be an O-methyltransferase 
involved either directly in the synthesis of the platenolide, or in the synthesis of a 
methylated precursor (methoxymalonyl, see figure 8) incorporated into the platenolide 
by PKS. To verify this hypothesis, LC/SM and NMR analytical experiments were 
carried out on a strain of S. ambofaciens of genotype : orf6*::attlQhyg+. In this strain, 
the orf5* gene is not expressed, owing to the polar effect of the insertion, into the orf6* 
gene, of the cassette which contains transcription terminators (cf. example 27). It has 
been shown that this strain produces a molecule for which the UV spectrum has an 
appearance similar to that of spiramycin I, but the mass spectrum shows a molecular ion 
at 829. The difference in mass of 14 compared to the mass of spiramycin can be 
explained by the absence of methyl on the oxygen borne by carbon No. 4 of the lactone 
ring (the structure of this compound is given in figure 39). The results obtained by 
NMR are compatible with this hypothesis. The presence of a compound at 829 makes it 
possible to validate the hypothesis of the role of orf5* in spiramycin biosynthesis. In 
addition, the product corresponding to spiramycin without the methyl group exhibits 
very weak microbiological activity (10-fold weaker) compared with the unmodified 
spiramycin, when tested on the microorganism Micrococcus luteus. 

The or/7*c gene encodes a protein exhibiting relatively strong similarity with a 
protein encoded by the mdmA of Streptomyces mycarofaciens, the latter gene encoding a 
protein involved in midecamycin resistance in the producer enzyme (Hara et al. 9 1990; 
GenBank accession number: A60725; BLAST score: 380). This similarity with the 
protein involved in the biosynthetic pathway for another antibiotic strongly suggests that 
the or/7*c gene also encodes a protein involved in spiramycin resistance. More 
particularly, the enzyme encoded by the or/7*c gene has methyltransferase activity and 
is involved in spiramycin resistance in Streptomyces ambofaciens. It has been 
demonstrated that this gene imparts resistance of the MLS I type, which resistance is 
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known to be due to monomethylation, at position A2058 of 23S ribosomal RNA 
(Pernodet et al. 9 1996). This hypothesis is supported by the fact that the protein encoded 
by the or/7*c gene exhibits strong similarity with other proteins of similar function in 
other organisms (cf. table 8). 



Table 8 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


macrolide-lincosamide-streptogramin B 
resistance determinant (S. fradiae) 


JC5319 


238 


23S rRNA 
methyltransferase 


23 S ribosomal RNA methyltransferase 
ErmML {Micrococcus luteus) 


AAL68827 


119 


23 S rRNA 
methyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al y 1990). 



The or/8* gene encodes a protein exhibiting relatively strong similarity with a 
protein of the ABC transporter type in Streptomyces griseus (Campelo, 2002, GenBank 
accession number: CAC221 19; BLAST score: 191). This similarity with a protein of the 
ABC transporter type strongly suggests that the or/8* gene also encodes a protein of the 
ABC transporter type which may be involved in spiramycin resistance. This hypothesis 
is supported by the fact that the protein encoded by the or/8* gene exhibits strong 
similarity with other proteins of similar function in other organisms (cf. table 9). 
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Table 9 



Protein exhibiting significant similaritv 


number 


score 


iveponeu 
function 


AcrW (Streptomyces galilaeus) 


BAB72060 


94 


ABC 

transporter 


Daunorubicin resistance transmembrane 
protein (Streptomyces peucetius) 


P32011 


89 


daunorubicin 
resistance 


Probable ABC-transporter, 
transmembrane component. 
(Streptomyces coelicolor) 


NP_626506 


89 


ABC 

transporter 



* greater sequence similarity is associated with a higher BLAST score (Altschul et aL, 
1990). 



The or/9* gene encodes a protein exhibiting relatively strong similarity with a 
5 protein of the ABC transporter type in Streptomyces griseus (Campelo, 2002, GenBank 
accession number: CAC221 18; BLAST score: 269). This similarity with a protein of the 
ABC transporter type strongly suggests that the or/9* gene also encodes a protein of the 
ABC transporter type which may be involved in spiramycin resistance. This hypothesis 
is supported by the fact that the protein encoded by the or/9* gene exhibits strong 
10 similarity with other proteins of similar function in other organisms (cf. table 10). 
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Table 10 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported 
function 


Probable ABC-type transport protein, ATP- 
binding component (S. coelicolor) 


NP_626505 


231 


ABC 

transporter 


Putative ABC transporter ATP-binding 
component (Streptomyces Coelicolor) 


NP_627624 


228 


ABC 

transporter 



* greater sequence similarity is associated with a higher BLAST score (Altschul et al, 
1990). 



The orflO* gene encodes a protein exhibiting relatively strong similarity with a 
5 protein of unknown function. However, genes similar to orflO* are found in the middle 
of several groups of genes involved in the biosynthesis of antibiotics. Thus, a gene close 
to orflO* is found in S. coelicolor (Redenbach et aL 9 1996, GenBank accession number: 
NP_627432, BLAST score: 109). A close gene (CouY) is also found in 5. rishiriensis 
(Wang et aL 9 2000, GenBank accession number: AAG29779, BLAST score 97). 

10 

Genes located upstream of the genes encoding PKSs 

In the DNA sequence located upstream of the genes encoding PKSs (downstream 
and upstream being defined by the orientation of the 5 PKS genes all oriented in the 
same direction) (cf. figure 3), 34 ORFs have been identified (cf. above). Thus, the 34 

15 open reading frames of this type occupy a region of approximately 41 .7 kb (cf. SEQ ID 
No. 1 exhibiting a first region of 31 kb containing 25 ORFs and SEQ ID No. 140 
exhibiting a region of approximately 12.1 kb, 1.4 kb of which overlap the preceding 
sequence (SEQ ID No. 1) and approximately 10.7 kb of which correspond to the 
subsequent sequence), the latter portion of approximately 10.7 kb containing 9 

20 additional ORFs (including an ORF partial sequence), cf. also figures 3 and 37 below). 
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A diagrammatic representation of the organization of the region is given in figures 5 and 
37. The 34 genes identified were named: orfl, orf2, or/3, or/4, orf5, orf6, or/7 f orf8 f 
or/9c, orflO, orfllc, orfl 2, orfl 3c, orfl 4, orfl 5c, orfl 6, orfl 7, orfl 8, orfl 9, orf20, 
orf21c, orf22c, orf23c t orf24c f orf25c, orf26, orf27 t orf28c, or/29, orf30c t orf31 f orf32c t 
5 or/33 and orf34c. 

Given in table 1 1 below are the references to the DNA and amino acid sequence of 
the 34 genes identified upstream of the 5 PKS genes. 



Table 1 1 



Gene 1 


Position in the senuence 
SEQ ID No. 1 


A cpnnpnrp 




orfl 


658 to 1869 


SEQ ID No. 23 


SEQ ID No. 24 


orfl 


1866 to 2405 


SEQ ID No. 25 


SEQ ID No. 26 and 27 


orfl 


2402 to 3568 


SEQ ID No. 28 


SEQ ID No. 29 


orf4 


3565 to 4473 


SEQ ID No. 30 


SEQ ID No. 31, 32 and 33 


orfl 


4457 to 5494 


SEQ ID No. 34 


SEQ ID No. 35 


orf6 


5491 to 6294 


SEQ ID No. 36 


SEQ ID No. 37, 38 and 39 


orfl 


6296 to 7705 


SEQ ID No. 40 


SEQ ID No. 41 and 42 


or/8 


801 1 to 9258 


SEQ ID No. 43 


SEQ ID No. 44 


orflc 


10081 to 9362 


SEQ ID No. 45 


SEQ ID No. 46 


orflO 


10656 to 12623 


SEQ ID No. 47 


SEQ ID No. 48 


orfllc 


14482 to 12734 


SEQ ID No. 49 


SEQ ID No. 50, 51 and 52 


orfll 


14601 to 16031 


SEQ ID No. 53 


SEQ ID No. 54, 55, 56, 57, 58 and 59 


orfl 3c 


17489 to 16092 


SEQ ID No. 60 


SEQ ID No. 61 


orfl4 


17809 to 18852 


SEQ ID No. 62 


SEQ ID No. 63 


orflSc 


20001 to 18961 


SEQ ID No. 64 


SEQ ID No. 65 
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orfl6 


20314 to 21552 


SEQ ID No. 66 


SEQ ID No. 67 


orfl7 


21609 to 22879 


SEQ ID No. 68 


SEQ ID No. 69 


orfl8 


22997 to 24175 


SEQ ID No. 70 


SEQ ID No. 71 


or/19 


24177 to 25169 


SEQ ID No. 72 


SEQ ID No. 73 


or/20 


25166 to 26173 


SEQ ID No. 74 


SEQ ID No. 75 


orplc 


27448 to 26216 


SEQ ID No. 76 


SEQ ID No. 77 


orf22c 


28560 to 27445 


SEQ ID No. 78 


SEQ ID No. 79 


orf23c 


29770 to 28649 


SEQ ID No. 80 


SEQ ID No. 81 


orf24c 


30074 to 29763 


SEQ ID No. 82 


SEQ ID No. 83 


orf25c 


30937 to 30071 


SEQ ID No. 84 


SEQ ID No. 85 


Gene 1 


Position in the sequence 
SEQ ID No. 1 


DNA sequence 


Polypeptide sequence(s) 2 


or/26 


1647 to 2864 


SEQ ID No. 107 


SEQ ID No. 108 


or/27 


2914 to 3534 


SEQ ID No. 109 


SEQ ID No. 110 


orf28c 


4967 to 3804 


SEQ ID No. 141 


SEQ ID No. 142 


or/29 


5656 to 6663 


SEQ ID No. 113 


SEQ ID No. 114 


or/30c 


7723 to 6686 
7534 to 6686 


SEQ ID No. 115 
SEQ ID No. 143 


SEQ ID No. 116 and 117 
SEQ ID No. 144 


or/31 


7754 to 8728 


SEQ ID No. 118 


SEQ ID No. 119 


or/32c 


10488 to 8977 


SEQ ID No. 145 


SEQ ID No. 146 


or/33 


10562 to 10837 


SEQ ID No. 147 


SEQ ID No. 148 


orf34c 


12134 to 10899 


SEQ ID No. 149 


SEQ ID No. 150 



The "c" added to the name of the gene indicates that the coding sequence is in the 
reverse orientation (the coding strand is therefore the strand complementary to the 
sequence given in SEQ ID No. 1 or SEQ ID No. 140 for these genes). 
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When several protein sequences are indicated for a single orf, the corresponding 
proteins are derived from several possible translation initiation sites. 

Three types of experiment were carried out with the aim of determining the 
5 function of the polypeptides identified in table 1 1 above: comparison of the identity of 
the identified sequences with sequences of known functions, gene inactivation 
experiments, and analyses of spiramycin production by these mutant strains. 

The protein sequences deduced from these open reading frames were all first of all 
compared with those present in various databases using various programs: BLAST 

10 (Altschul et aL, 1990) (Altschul et aL, 1997), CD-search, COGs (Cluster of Orthologous 
Groups), FASTA ((W.R. Pearson & D.J. Lipman, 1988) and (W.R. Pearson, 1990), 
BEAUTY (K.C. Worley et aL, 1995)), (cf. above). These comparisons made it possible 
to formulate hypotheses regarding the function of the products of these genes and to 
identify those liable to be involved in spiramycin biosynthesis. Table 12 shows the 

15 proteins exhibiting strong similarity with the 34 genes located upstream of the 5 PKF 
genes. 



Table 12 



Gene 


Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST 
* score 


Reported function 


orfl 


Cytochrome P450 tyll {Streptomyces 
fradiae) 


S49051 


530 


Cytochrome P450 


orf2 


ORF 15x4 (Lis tone I la anguillarum) 


AAB81630 


113 


unknown 


or/3 


aminotransferase-like protein 
(Streptomyces antibioticus) 


AAF59939 


431 


Amino-transferase 


or/4 


alpha-D-glucose-1 -phosphate 
thymidylyltransferase (Streptomyces 
venezuelae) 


AAC68682 


404 


alpha-D-glucose- 
1 -phosphate 
thymidylyl- 
transferase 
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orfS 


AprE (Streptomyces tenebrarius) 


AAG 18457 


476 


dTDP-glucose 
4,6-dehydratase 


Of JO 


i iiiucbLcicibc yoirepiomyces avermitiiis) 


bADoyj 1 5 


234 


Thioesterase 


or/7 


TylCVI {Streptomyces fradiae) 


AAF29379 


461 


dNTP hexose 
2,3 -dehydratase 


or/8 


probable aminotransferase 
y&accnaropoiyspora spinosa) 


AAG23279 


465 


Aminotransferase 


or/9c 


SrmX {Streptomyces ambofaciens) 


S25204 


445 


Methyltransferase 


orflO 


SrmR {Streptomyces ambofaciens) 


S25203 


1074 


Regulatory 
protein 


orfllc 


SrmB {Streptomyces ambofaciens) 


S25202 


955 


Spiramycin 
resistance 


orfl2 


UrdQ {Streptomyces fradiae) 


AAF72550 


634 


NDP-hexose 
3,4-dehydratase 


orflSc 


SC4H2.17 {Streptomyces coelicolor) 


T35116 


619 


Unknown 


or/14 


putative reductase {Streptomyces 
coelicolor) 


CAB90862 


147 


Reductase 


orfl5c 


Probable 3-keto reductase 
{Streptomyces antibioticus) 


T51102 


285 


3-keto reductase 


or/16 


Hypothetical NDP hexose 3,4- 
isomerase {Streptomyces fradiae) 


CAA57471 


209 


NDP hexose 
3,4-isomerase 


or/77 


Glycosyl transferase {Streptomyces 
venezuelae) 


AAC68677 


400 


Glycosyl- 
transferase 


or/18 


Glycosyltransferase {Streptomyces 
rishiriensis) 


AAG29785 


185 


Glycosyl- 
transferase ! 


or/19 


NDP-hexose 4-keto reductase TylCIV 
{Streptomyces fradiae) 


AAD41822 


266 


NDP-hexose 
4-keto reductase 


or/20 


EryBII {Saccharopolyspora erythraea) 


AAB84068 


491 


aldo-keto 
reductase 
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orf21c 


TylCIII (Streptomyces fradiae) 


AAD41823 


669 


NDP-hexose 3- 

C-methyl- 

transierase 


orf22c 


FkbH {Streptomyces hygroscopicus) 


AAF86387 


463 


Involved in 

methoxymalonyl 

biosynthesis 


orf23c 


Fkbl (Streptomyces hygroscopicus) 


AAF86388 


387 


Acyl-CoA 
dehydrogenase 


orf24c 


FkbJ (Streptomyces hygroscopicus) 


AAF86389 


87 


Involved in 

methoxymalonyl 

biosynthesis 


OrJZJC 


FkbK (Streptomyces hygroscopicus) 




268 


Acyl-CoA 
dehydrogenase 


orjzo 


iyiL,v (otreptomyces jradiae) 


A A T\A 1 O A 

AAD41 824 


471 


Mycarosyl- 
transferase 


orjz/ 


lyicvii (Streptomyces jraaiae) 


AAD41o25 


243 


NDP-hexose 3,5- 
(or 5-) epimerase 


orf28c 


AcyB2 (Streptomyces thermotolerans) 


JC2032 


329 


regulatory 
protein 


orf29 


Beta-man annase (Sorangium 
ceuuiosum ) 


AAK 19890 


139 


Glycosyl- 
hydrolase 


orftOc 


Nucleoside-diphosphate-sugar 
cpimerctbe \\^oryrieoucierium 
glutamicum) 


NP_600590 


89 


Nucleoside- 
diphosphate- 
sugar epimerase 


OrjDl 


vjAjQorcuuciase y&irepiomyces 
coelicolor) 


XTT) All 1 A Q 

iNr_o;> I 145 


2ol 


Oxidoreductase 


orJ32c 


Reculatorv nrotein of the GntR familv 
(Streptomyces avermitilis) 


NP 824604 




protein 


or/33 


Hypothetical protein 
(Xanthomonas campestris) 


NP_635564 


54 


Unknown 


orf34c 


Arabinofuranosidase 
(Streptomyces coelicolor) 


NP_630049 


654 


Arabino- 
furanosidase 
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* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 

Gene inactivation experiments were carried out in order to confirm these results. 
The methods used consist in performing a gene replacement. The target gene to be 
5 interrupted is replaced with a copy of this gene interrupted with a cassette imparting 
resistance to an antibiotic (for example apramycin or hygromycin). The cassettes used 
are bordered on either side by translation termination codons in all reading frames and 
by transcription terminators which are active in Streptomyces. Insertion of the cassette 
into the target gene may or may not be accompanied by deletion in this target gene. The 
10 size of the regions flanking the cassette may range from a few hundred to several 
thousand base pairs. A second type of cassette may be used for the gene inactivation: 
cassettes termed "excisable cassettes" (cf. above). The strains thus constructed were 
tested for their spiramycin production. 

The orfl gene encodes a protein exhibiting relatively strong similarity with several 
15 cytochrome P450s. In particular, the protein encodied by orfl has considerable similarity 
with the protein encoded by the tyll gene involved in the biosynthesis of tylosine in 
Streptomyces fradiae (L.A. Merson-Davies et al, 1994; GenBank accession number: 
S49051; BLAST score: 530). This similarity with a protein involved in the biosynthetic 
pathway for another close antibiotic strongly suggests that the orfl gene also encodes a 
20 cytochrome P450. This hypothesis is supported by the fact that the protein encoded by 
the orfl gene exhibits strong similarity with other proteins of similar function in other 
organisms (cf. table 13). 



Table 13 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Putative cytochrome P450 YJIB 
{Bacillus subtilis) 


Q34374 


248 


Cytochrome P450 
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Cytochrome P450 113A1 
(Saccharopolyspora erythraed) 


P48635 


237 


Cytochrome P450 


Cytochrome P-450 hydroxylase 
homolog {Streptomyces caelestis) 


AAC46023 


208 


Cytochrome P450 
hydroxylase 


Cytochrome P450 monooxygenase 
{Streptomyces venezuelae) 


AAC64105 


206 


Cytochrome P450 
monooxygenase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 



The or/2 gene encodes a protein exhibiting relatively strong similarity with a 
dTDP-6-deoxy-3,4-ketohexulose isomerase of Aneurinibacillus thermoaerophilus 
5 (Pfoestl, A. et al, 2003, GenBank accession number: AAO06351; BLAST score: 118). 
This similarity strongly suggests that the or/2 gene encodes an isomerase responsible for 
the isomerization reaction required for the biosynthesis of one of the sugars present in 
the spiramycin molecule, this sugar possibly being mycarose (cf figure 5). Inactivation 
of the or/2 gene was performed. It could be shown that the resulting strain no longer 
10 " produces spiramycins. This confirms that the or/2 gene is indeed involved in spiramycin 
biosynthesis. 

The or/3 gene encodes a protein exhibiting relatively strong similarity with several 
aminotransferases. In particular, the protein encoded by or/3 has considerable similarity 
with an aminotransferase of Streptomyces antibioticus involved in oleandomycin 

15 biosynthesis (G. Draeger et aL, 1999; GenBank accession number: AAF59939; BLAST 
score: 431). This similarity with a protein involved in the biosynthetic pathway for 
another close antibiotic strongly suggests that the or/3 gene encodes a 
3 -aminotransferase responsible for the transamination reaction necessary for the 
biosynthesis of one of the amino sugars of spiramycins (cf. figure 5). This hypothesis is 

20 supported by the fact that the protein encoded by the or/3 gene exhibits strong similarity 
with other proteins of similar function in other organisms (cf. table 15). 
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Table 15 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Aminotransferase 
{Streptomyces antibioticus) 


T51111 


429 


Aminotransferase 


Transaminase 
{Streptomyces venezuelae) 


AAC68680 


419 


Transaminase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 



The or/3 gene was inactivated. It was thus possible to show that the resulting strain 
no longer produces spiramycins. This confirms that the or/3 gene is indeed involved in 
spiramycin biosynthesis. The enzyme encoded by this gene is therefore clearly 
responsible for a byconversion step essential to spiramycin biosynthesis. Spiramycin 
production can be complemented by the expression of the TylB protein of 5. fradiae 
(cf. example 23). This demonstrates that the or/3 gene encodes a 3-aminotransferase 
responsible for the transamination reaction necessary for mycaminose biosynthesis 
(cf. figure 5). Since mycaminose is the first sugar to be attached to platenolide, the strain 
with the or/3 knockout (OS49.67) is expected to accumulate platenolide. 

The biosynthesis intermediates of the strain with the or/3 gene knockout were 
studied (cf. example 20). These experiments made it possible to demonstrate that this 
strain produces two forms of platenolide: platenolide A and platenolide B, the deduced 
structure of these two molecules is given in figure 36. This strain also produces 
platenolide A + mycarose and platenolide B + mycarose (cf. example 20 and figure 40). 
These compounds comprise a sugar but do not comprise any mycaminose. In addition, if 
they are compared with spiramycin (cf. figure 1), these compounds comprise mycarose 
in place of mycaminose. These results are in agreement with the product of the or/3 gene 
being involved in mycaminose biosynthesis and it having a role as a 3 aminotransferase 
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responsible for the transamination reaction required for mycaminose biosynthesis 
(cf. figure 5). It may be noted that the specificity of glycosylation does not appear to be 
absolute since molecules with mycarose attached at the position normally occupied by 
mycaminose are found (cf. figure 40). 

5 The or/4 gene encodes a protein exhibiting relatively strong similarity with several 

NDP-glucose synthetases. In particular, the protein encoded by orf4 has considerable 
similarity with an alpha-D-glucose-1 -phosphate thymidylyltransferase of Streptomyces 
venezuelae (Y. Xue et aL, 1998; GenBank accession number: AAC68682; BLAST 
score: 404). This similarity with a protein involved in the biosynthetic pathway for 

10 another close antibiotic strongly suggests that the orf4 gene encodes an NDP-glucose 
synthetase responsible for the synthesis of NDP-glucose necessary for the biosynthesis 
of the three atypical sugars incorporated into the spiramycin molecule (cf. figures 4, 5 
and 6). This hypothesis is supported by the fact that the protein encoded by the orf4 gene 
exhibits strong similarity with other proteins of similar function in other organisms 

15 (cf. table 16). 



Table 16 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Glucose- 1 -phosphate 
thymidyltransferase 
{Streptomyces avermitilis) 


BAA84594 


402 


Glucose- 1- 

phosphate 

thymidyltransferase 


Acl Y {Streptomyces galilaeus) 


BAB72036 


400 


dTDP-1 -glucose 
synthetase 


Putative glucose- 1 -phosphate 
thymidyltransferase 
{Saccharopolyspora spinosa) 


AAK83289 


399 


glucose- 1- 
phosphate 
thymidyltransferase 
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* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 

The orf5 gene encodes a protein exhibiting relatively strong similarity with several 
glucose dehydratases. In particular, the protein encoded by or/5 has considerable 
5 similarity with a dTDP-glucose 4,6-dehydratase of Streptomyces tenebrarius (T.B. Li et 
al, 2001; GenBank accession number: AAG18457, BLAST score: 476). This similarity 
with a protein involved in the biosynthetic pathway for another close antibiotic strongly 
suggests that the orf5 gene encodes an NDP-glucose dehydratase necessary for the 
biosynthesis of the three atypical sugars incorporated into the spiramycin molecule 
10 (cf. figures 4, 5 and 6). This hypothesis is supported by the fact that the protein encoded 
by the or/5 gene exhibits strong similarity with other proteins of similar function in other 
organisms (cf. table 1 7). 



Table 17 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


dTDP-glucose 4,6-dehydratase 
{Saccharopolyspora spinosa) 


AAK83290 


464 


dTDP-glucose 
4,6-dehydratase 


thymidine diphosphoglucose 
4,6-dehydratase 
{Saccharopolyspora erythraea) 


AAA68211 


445 


thymidine 

diphosphoglucose 

4,6-dehydratase 


dTDP-glucose 4,6-dehydratase 

(EC 4.2.1.46) - {Streptomyces fradiae) 


S49054 


443 


dTDP-glucose 
4,6-dehydratase 


TDP-glucose 4,6-dehydratase 
{Streptomyces venezuelae) 


AAC68681 


421 


TDP-glucose 
4,6-dehydratase 
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SgcA {Streptomyces globisporus) 


AAF13998 


418 


dNDP-glucose 








4,6-dehydratase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 



The orf6 gene encodes a protein exhibiting relatively strong similarity with several 
thioesterases. In particular, the protein encoded by orf6 has considerable similarity with 
5 a thioesterase of Streptomyces avermitilis (S. Omura et al, 2001; GenBank accession 
number: BAB69315; BLAST score: 234). This similarity with a protein involved in the 
biosynthetic pathway for another close antibiotic strongly suggests that the orf6 gene 
encodes a thioesterase. This hypothesis is supported by the fact that the protein encoded 
by the orf6 gene exhibits strong similarity with other proteins of similar function in other 
10 organisms (cf. table 18). 



Table 18 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


RifR {Amycolatopsis mediterranei) 


AAG52991 


216 


Thioesterase 


Thioesterase - {Streptomyces fradiae) 


S49055 


215 


Thioesterase 


Thioesterase 

{Streptomyces avermitilis) 


BAB69188 


213 


Thioesterase 


Thioesterase II (EC 3.1.2.-) - 
{Streptomyces venezuelae) 


T17413 


203 


Thioesterase 


PimI protein {Streptomyces ndtalensis) 


CAC20922 


201 


Thioesterase 


Thioesterase {Streptomyces griseus) 


CAC22116 


200 


Thioesterase 



50 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al, 1990). 

The or/7 gene encodes a protein exhibiting relatively strong similarity with several 
hexose dehydratases. In particular, the protein encoded by or/7 has considerable 
similarity with a dNTP-hexose 2,3-dehydratase (encoded by the TylCVI gene) of 
Streptomyces fradiae involved in tylosine biosynthesis (L.A. Merson-Davies et al, 
1994; GenBank accession number: AAF29379; BLAST score: 461). This similarity with 
a protein involved in the biosynthetic pathway for another close antibiotic strongly 
suggests that the or/7 gene also encodes a hexose 2-3-dehydratase necessary for the 
biosynthesis of two atypical sugars incorporated into the spiramycin molecule 
(cf. figures 4 and 6). This hypothesis is supported by the fact that the protein encoded by 
the or/7 gene exhibits strong similarity with other proteins of similar function in other 
organisms (cf. table 19). 



Table 19 



Protein exhibiting significant 
similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Rifl 8 {Amycolatopsis mediterranei) 


AAG52988 


459 


Hexose dehydratase 


SimB3 {Streptomyces antibioticus) 


AAK06810 


444 


dNDP-4-keto-6-deoxy- 
glucose 2,3-dehydratase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al, 1990). 



The or/8 gene encodes a protein exhibiting relatively strong similarity with several 
aminotransferases. In particular, the protein encoded by or/8 has considerable similarity 
with an aminotransferase probably involved in forosamine biosynthesis in 
Saccharopolyspora spinosa (C. Waldron et al, 2001; GenBank accession number: 
AAG23279; BLAST score: 465). This similarity with a protein involved in the 
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biosynthetic pathway for another close antibiotic strongly suggests that the or/8 gene 
encodes a 4-aminotransferase responsible for the transamination reaction necessary for 
forosamine biosynthesis (cf. figure 6). This hypothesis is supported by the fact that the 
protein encoded by the or/8 gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 20). 



Table 20 



Protein exhibiting significant 
similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Putative amino sugar biosynthesis 
protein (Bordetella bronchiseptica) 


CAA07666 


213 


Protein involved in amino 
sugar biosynthesis 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al, 1990). 



The or/8 gene was inactivated. It was thus possible to show that the resulting strain 
no longer produces spiramycins. This confirms that the or/8 gene is indeed involved in 
spiramycin biosynthesis. The enzyme encoded by this gene is therefore clearly 
responsible for a byconversion step essential to spiramycin biosynthesis. Validation of 
the hypothesis of the role played by the product of the or/8 gene in forosamine 
biosynthesis is provided by the fact that an inactivated mutant for the or/8 gene produces 
forocidin, this mutant is therefore blocked at the forocidin stage and does not produce 
any neo-spiramycin (cf. figure 7 and example 25). These results are in agreement with 
the product of the or/8 gene being involved in forosamine biosynthesis (cf. figure 6). 

The or/9c gene has already been identified in Streptomyces ambofaciens and has 
been named srmX by Geistlich et al. (M. Geistlich et al., 1992). The similarity of the 
protein encoded by this gene with several methyltransferases involved in the 
biosynthetic pathway for other close antibiotics strongly suggests that the or9c gene 
encodes a methyltransferase responsible for the methylation reaction necessary for 
mycaminose or forosamine biosynthesis (cf. figures 5 and 6). This hypothesis is 
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supported by the fact that the protein encoded by the or/9c gene exhibits strong 
similarity with other proteins of similar function in other organisms (cf. table 21). 

Table 21 



Protein exhibiting significant 
similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


N,N-dimethyltransferase 
{Streptomyces venezuelae) 


AAC68678 


240 


N,N-dimethyl- 
transferase 


Methyltransferase 
{Streptomyces antibioticus) 


CAA05643 


232 


Methyltransferase 


Putative aminomethylase 
{Streptomyces nogalater) 


AAF01819 


219 


Aminomethylase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



5 al, 1990). 

The or/7 0 gene has already been identified in Streptomyces ambofaciens and has 
been named srmR by Geistlich et al (M. Geistlich et al, 1992). The protein encoded by 
this gene is involved in regulation of the biosynthetic pathway for spiramycins in 
Streptomyces ambofaciens. The or/10 gene was inactivated. It was thus possible to show 
10 that the resulting strain no longer produces spiramycins. This confirms that the or/10 
gene is indeed involved in spiramycin bisynthesis. The protein encoded by this gene is 
therefore clearly essential for spiramycin biosynthesis. 

In addition, the translation initiation point of or/10 was determined and it was 
possible to show that overexpression of this gene leads to an improvement in the 
15 production of spiramycins. The translation initiation site corresponds to an ATG located 
upstream of the ATG proposed by Geistlich et al (M. Geistlich et al, 1992). It was also 
demonstrated that this 5' end is essential to the function of OrflO since a 5'-truncated 
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messenger is inactive (cf. example 17). To obtain the desired effect on the production of 
spiramycins, it is therefore essential for the overexpression of orflO to be effected while 
taking care not to express a 5 '-truncated messenger oiorflO. 

The orfllc gene has already been identified in Streptomyces ambofaciens and has 
5 been named srmB by Geistlich et al (M. Geistlich et al, 1992) and Schoner et al 
(B. Schoner et al, 1992). The protein encoded by this gene is involved in spiramycin 
resistance in Streptomyces ambofaciens and is an ABC-type transporter. 

The or/12 gene encodes a protein exhibiting relatively strong similarity with 
several hexose dehydratases. In particular, the protein encoded by orfl2 has considerable 

10 similarity with an NDP-hexose 3,4-dehydratase encoded by the UrdQ gene of 
Streptomyces fradiae and involved in urdamycin biosynthesis (D. Hofimeister et al., 
2000; GenBank accession number: AAF72550; BLAST score: 634). This similarity with 
a protein involved in the biosynthetic pathway for another close antibiotic strongly 
suggests that the orfl2 gene encodes a 3,4 dehydratase responsible for the dehydration 

15 reaction necessary for forosamine biosynthesis (cf. figure 6). This hypothesis is 
supported by the fact that the protein encoded by the orfl2 gene exhibits strong 
similarity with other proteins of similar function in other organisms (cf. table 22). 



Table 22 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


AknP (Streptomyces galilaeus) 


AAF73452 


625 


3 -dehydratase 


NDP-hexose 3,4-dehydratase homolog 
(Streptomyces cyanogenus) 


AAD13547 


624 


NDP-hexose 
3,4-dehydratase 


Rdml (Streptomyces purpurascens) 


AAL24451 


608 


hexose-C-3-dehydratase 
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Probable CDP-4-keto-6-deoxyglucose 
3-dehydratase (El) 
{Streptomyces violaceoruber) 


T46528 


602 


CDP-4-keto-6-deoxy- 
glucose 3-dehydratase 


Probable NDP-hexose 3,4-dehydratase 
(Saccharopolyspora spinosa) 


AAG23278 


582 


NDP-hexose 
3,4-dehydratase 


dNTP-hexose dehydratase 
(Amycolatopsis mediterranei) 


AAC01730 


576 


dNTP-hexose 
dehydratase i 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



aL, 1990). 

The orfl2 gene was inactivated. It was possible to show that the resulting strain no 
longer produces spiramycins. This confirms that the orfl2 gene is indeed involved in 
5 spiramycin biosynthesis. The enzyme encoded by this gene is therefore clearly 
responsible for a bioconversion step essential to spiramycin biosynthesis. Validation of 
the hypothesis of the role played by Orfl2 in forosamine biosynthesis is provided by the 
fact that an inactivated mutant for the orfl2 gene no longer produces forosamine. 
However, it produces a small amount of forocidin. This mutant is therefore blocked at 

10 the forocidin stage and does not produce any neo-spiramycin (cf. figure 7 and 
example 26). This mutant also produces a compound having the structure shown in 
figure 38. The latter compound contains two sugars, mycaminose and mycarose, but 
does not contain any forosamine. In addition, if it is compared with the structure of 
spiramycin (cf. figure 1), this compound contains the sugar mycarose in the expected 

15 place of forosamine. These results are in agreement with the product of the or/72 gene 
being involved in forosamine biosynthesis (cf. figure 6). It may be noted that the 
specificity of glycosylation is not absolute since molecules in which mycarose is 
attached at the position normally occupied by forosamine are observed (see figure 38). 

The orf!3c gene encodes a protein exhibiting relatively strong similarity with a 
20 protein of unknown function in Streptomyces coelicolor. This protein was named 
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SC4H2.17 (GeneBank accession number: T35116; BLAST score: 619). The protein 
encoded by the orfl3c gene also exhibits strong similarity with other proteins of other 
organisms (cf. table 23). 



Table 23 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


hflX protein (Mycobacterium leprae) 


S72938 


473 


unknown 


Possible ATP/GTP-binding protein 
(Mycobacterium leprae) 


NP_301739 


470 


ATP/GTP-binding 
protein 


GTP-binding protein (Mycobacterium 
tuberculosis CDC1551) 


AAK47114 


468 


GTP-binding protein 


ATP/GTP-binding protein 
(Streptomyces fradiae) 


T44592 


388 


ATP/GTP-binding 
protein 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al., 1990). 



No precise function has been attributed to the proteins close to that encoded by 
orfl3c. The orfl3c gene was inactivated for the purpose of studying the function of this 
gene in the biosynthetic pathway for spiramycins in Streptomyces ambofaciens. It was 
possible to show that the resulting strain produces spiramycins. This indicates that the 
orfl3c gene is not essential to spiramycin biosynthesis and that it is not essential to the 
survival of the bacterium. The enzyme encoded by this gene is not therefore responsible 
for a biocon version step essential to spiramycin biosynthesis. 

The orfl4 gene encodes a protein exhibiting relatively strong similarity with a 
putative reductase (M. Redenbach et al, 1996; Bentley et al., 2002; GenBank accession 
number: CAB90862; BLAST score: 147). 
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The orfl4 gene was inactivated. It was thus possible to show that the resulting 
strain no longer produces spiramycins. This confirms that the orfl4 gene is indeed 
involved in spiramycin biosynthesis. The enzyme encoded by this gene is therefore 
clearly responsible for a byconversion step essential to spiramycin biosynthesis. The 
5 biosynthesis intermediates of the strain with the orfl4 gene knockout were studied 
(cf. example 20). These experiments made it possible to demonstrate that this strain 
produces platenolide A but does not produce platenolide B (cf. figure 36). 

The orfl 5c gene encodes a protein exhibiting relatively strong similarity with 
several keto reductases. In particular, the protein encoded by orfl 5c has considerable 

10 similarity with a 3-keto reductase in Streptomyces antibioticus (GenBank accession 
number: T51102, BLAST score: 285). This similarity strongly suggests that the orfl 5c 
gene encodes a 3-keto reductase responsible for the reduction reaction necessary for 
forosamine biosynthesis (cf. figure 6). This hypothesis is supported by the fact that the 
protein encoded by the orfl 5c gene exhibits strong similarity with other proteins of 

15 similar function in other organisms (cf. table 24). 



Table 24 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


oxidoreductase homolog 
{Streptomyces cyanogenus) 


AAD13550 


272 


Oxidoreductase 


D-oliose 4-keto reductase 
{Streptomyces argillaceus) 


CAB96550 


265 


D-oliose 4-keto 
reductase 


AknQ {Streptomyces galilaeus) 


AAF73453 


263 


putative 3-keto 
reductase 
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Probable NDP-hexose 3-keto reductase 


AAG23275 


253 


NDP-hexose 3-keto 


{Saccharopolyspora spinosa) 






reductase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al, 1990). 



The orfl6 gene encodes a protein exhibiting relatively strong similarity with 
several isomerases. In particular, the protein encoded by orfl6 has considerable 
similarity with an NDP-hexose 3,4-isomerase in Streptomyces fradiae (Gandecha et aL, 
1997 GenBank accession number: CAA57471, BLAST score: 209). This similarity 
strongly suggests that the orfl 6 gene encodes a protein involved in the biosynthesis of 
one of the sugars of spiramycin (cf. figures 5 and 6). This hypothesis is supported by the 
fact that the protein encoded by the orfl 6 gene exhibits strong similarity with other 
proteins of similar function in other organisms (cf. table 25). 



Table 25 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Putative tautomerase 
{Streptomyces venezuelae) 


AAC68676 


145 


Tautomerase 


TDP-4-keto-6-deoxyhexose 
3,4-isomerase {Micromonospora 
megalomicea subsp. nigra) 


AAG 13907 


112 


TDP-4-keto- 

6-deoxyhexose 

3,4-isomerase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
ai, 1990). 



The orfl 7 gene encodes a protein exhibiting relatively strong similarity with 
several glycosyltransferases. In particular, the protein encoded by orfl 7 has considerable 
similarity with a glycosyltransferase of Streptomyces venezuelae (Y. Xue et al, 1998; 
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GenBank accession number: AAC68677; BLAST score: 400). The similarity of the 
protein encoded by the orfl7 gene with several glycosyltransferases involved in the 
biosynthetic pathway for other close antibiotics strongly suggests that this gene also 
encodes a glycosyltransferase. This hypothesis is supported by the fact that the protein 
encoded by the orfl7 gene exhibits some similarity with other proteins of similar 
function in other organisms (cf. table 26). 



Table 26 



Protein exhibiting significant 
similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function " 


Glycosyltransferase 
{Streptomyces fradiae) 


CAA57472 


399 


Glycosyltransferase 


Glycosyltransferase 
{Streptomyces antibioticus) 


CAA05642 


378 


Glycosyltransferase 


Glycosyltransferase 
{Streptomyces antibioticus) 


CAA05641 


360 


Glycosyltransferase 


Glycosyltransferase 
{Saccharopolyspora erythraea) 


CAA74710 


344 


Glycosyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



aL, 1990). 

The or/18 gene encodes a protein exhibiting relatively strong similarity with 
several glycosyltransferases. In particular, the protein encoded by or/78 has considerable 
similarity with a glycosyltransferase of Streptomyces rishiriensis (Wang et al, 2000; 
GenBank accession number: AAG29785; BLAST score: 185). The similarity of the 
protein encoded by the or/18 gene with several glycosyltransferases involved in the 
biosynthetic pathway for other close antibiotics strongly suggests that this gene also 
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encodes a glycosyltransferase. This hypothesis is supported by the fact that the protein 
encoded by the or/18 gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 27). 



Table 27 



Protein exhibiting significant 
similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


NovM {Streptomyces spheroides) 


AAF67506 


184 


Glycosyltransferase 


probable glycosyltransferase 
{Streptomyces violaceoruber) 


T46519 


169 


Glycosyltransferase 


Glycosyltransferase homolog 
{Streptomyces cyanogenus) 


AAD13553 


167 


Glycosyltransferase 


Glycosyltransferase homolog 
{Streptomyces cyanogenus) 


AAD13555 


163 


Glycosyltransferase 


dNTP-hexose glycosyltransferase r 
{Amycolatopsis mediterranei) 


AAC01731 


160 


Glycosyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al. 9 1990). 



The orfl9 gene encodes a protein exhibiting relatively strong similarity with 
several keto reductases. In particular, the protein encoded by orfl9 has considerable 
similarity with an NDP-hexose 4-keto reductase (TylCIV) Streptomyces fradiae (Bate et 
al„ 2000; GenBank accession number: AAD41822; BLAST score: 266). The similarity 
of the protein encoded by the orfl9 gene with this keto reductase involved in the 
biosynthetic pathway for a close antibiotic strongly suggests that this gene also encodes 
a 4-keto reductase responsible for the reduction reaction necessary for mycarose 
biosynthesis (cf. figure 4). This hypothesis is supported by the fact that the protein 
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encoded by the orfl9 gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 28). 

Table 28 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


NDP-4-keto-6-deoxyhexose 4-keto 
reductase {Streptomyces venezuelae) 


AAL14256 


251 


NDP-4-keto-6- 
deoxyhexose 4-keto 
reductase 


EryBIV {Saccharopolyspora erythraea) 


AAB84071 


249 


oxidoreductase 


TDP-4-keto-6-deoxyhexose 4-keto 
reductase {Micromonospora 
megalomicea subsp. Nigra) 


AAG13916 


218 


TDP-4-keto-6- 
deoxyhexose 4-keto 
reductase 


dTDP-4-keto-6-deoxy-L-hexose 
4-reductase {Streptomyces avermitilis) 


BAA84595 


212 


dTDP-4-keto-6- 
deoxy-L-hexose 
4-reductase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



5 a/., 1990). 

The orf20 gene encodes a protein exhibiting relatively strong similarity with 
several hexose reductases. In particular, the protein encoded by orf20 has considerable 
similarity with the EryBII of Saccharopolyspora erythraea which encodes a dTDP-4- 
keto-L-6-deoxyhexose 2,3-reductase (R.G. Summers et aL, 1997), GenBank accession 
10 number: AAB84068; BLAST score: 491). The similarity of the protein encoded by the 
orf20c gene with several hexose reductases involved in the biosynthetic pathway for 
other close antibiotics strongly suggests that this gene encodes a 2,3-reductase 
responsible for the reduction necessary for mycarose biosynthesis (cf. figure 4). This 
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hypothesis is supported by the fact that the protein encoded by the orf20c gene exhibits 
strong similarity with other proteins of similar function in other organisms (cf. table 29). 

Table 29 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


TylCII (Streptomyces fradiae) 


AAD41821 


464 


NDP-hexose 2,3- 
enoyl reductase 


TDP-4-keto-6-deoxyhexose 
2,3-reductase (Micromonospora 
megalomicea subsp. Nigra) 


AAG13914 


446 


TDP-4-keto-6- 
deoxyhexose 2,3- 
reductase 


dTDP-4-keto-6-deoxy-L-hexose 

2,3-reductase 

(Streptomyces avermitilis) 


BAA84599 


377 


dTDP-4-keto-6- 
deoxy-L-hexose 2,3- 
reductase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



5 a/., 1990). 

The orfllc gene encodes a protein exhibiting relatively strong similarity with 
several hexose methyltransferases. In particular, the protein encoded by orfllc has 
considerable similarity with the TylCIII gene of Streptomyces fradiae which encodes an 
NDP-hexose 3-C-methyltransferase (N. Bate et al. y 2000; GenBank accession number: 

10 AAD41823; BLAST score: 669). The similarity of the protein encoded by the or/2 J c 
gene with several hexose methyltransferases involved in the biosynthetic pathway for 
other close antibiotics strongly suggests that this gene encodes a hexose 
methyltransferase responsible for the methylation reaction necessary for mycarose 
biosynthesis (cf. figure 4). This hypothesis is supported by the fact that the protein 

15 encoded by the or/2 J c gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 30). 
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Table 30 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


eryH {Saccharopolyspora erythraea) 


228448 


592 


Erythromycin 
biosynthesis gene 


S-adenosyl-dependent 
methyltransferase (Coxiella burnetii) 


AAK71270 


358 


Methyltransferase 


NovU (Streptomyces spheroides) 


AAF67514 


184 


C-methyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
ai, 1990). 



The orf22c gene encodes a protein exhibiting relatively strong similarity with the 
5 protein encoded by the JkbH gene of Streptomyces hygroscopicus var. ascomyceticus 
which encodes an enzyme involved in methoxymalonyl biosynthesis (K. Wu et aL, 
2000; GenBank accession number: AAF86387; BLAST score: 463). The similarity of 
the protein encoded by the orf22c gene with this protein involved in the biosynthetic 
pathway for another close macrolide strongly suggests that this gene also encodes an 
10 enzyme involved in methoxymalonyl biosynthesis in Streptomyces ambofaciens 
(cf. figure 8). 

The orf23c gene encodes a protein exhibiting relatively strong similarity with the 
protein encoded by the jkbl gene of Streptomyces hygroscopicus var. ascomyceticus 
which encodes an acyl-CoA dehydrogenase involved in methoxymalonyl (K. Wu, et aL, 
15 2000; GenBank accession number: AAF86388; BLAST score: 387). The similarity of 
the protein encoded by the orf23c gene with several acyl-CoA dehydrogenases involved 
in the biosynthetic pathway for other close antibiotics strongly suggests that this gene 
encodes an acyl-CoA dehydrogenase involved in methoxymalonyl biosynthesis 
(cf. figure 8). This hypothesis is supported by the fact that the protein encoded by the 
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orf23c gene exhibits strong similarity with other proteins of similar function in other 
organisms (cf. table 31). 

Table 31 



Protein exhibiting significant 
similarity 


GeaBank 
accession 
number 


BLAST* 
score 


Reported function 


Acyl-CoA dehydrogenase 
(Polyangium cellulosum) 


AAK19892 


171 


Acyl-CoA 
dehydrogenase 


Probable acyl-CoA dehydrogenase - 
{Streptomyces coelicolor) 


T36802 


160 


acyl-CoA 
dehydrogenase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 



5 a/., 1990). 

The orf24c gene encodes a protein exhibiting relatively strong similarity with the 
protein encoded by the fkbJ gene of Streptomyces hygroscopicus var. ascomyceticus 
which is thought to encode the acyl carrier protein (ACP) involved in methoxymalonyl 
biosynthesis (K. Wu et al. 9 2000; GenBank accession number: AAF86389; BLAST 
10 score: 87). The similarity of the protein encoded by the orf24c gene with this protein 
involved in the biosynthetic pathway for another close macrolide strongly suggests that 
this gene encodes a protein involved in methoxymalonyl biosynthesis in Streptomyces 
ambofaciens (cf. figure 8). 

The orf25c gene encodes a protein exhibiting a relatively strong similarity with the 
15 protein encoded by the fkbK gene of Streptomyces hygroscopicus var. ascomyceticus 
which encodes an acyl-CoA dehydrogenase involved in methoxymalonyl biosynthesis 
(K. Wu et al. 9 2000; GenBank accession number: AAF86390; BLAST score: 268). The 
similarity of the protein encoded by the orf25c gene with several acyl-CoA 
dehydrogenases involved in the biosynthetic pathway for other close antibiotics strongly 
20 suggests that this gene encodes an acyl-CoA dehydrogenase involved in 



64 



methoxymalonyl biosynthesis (cf. figure 8). This hypothesis is supported by the fact that 
the protein encoded by the orf25c gene exhibits strong similarity with other proteins of 
similar function in other organisms (cf. table 32). 



Table 32 



Protein exhibiting significant similarity 


GenBank 
number 


BLAST* 


Reported function 


Probable 3-Hydroxybutyryl-CoA 
dehydrogenase {Bacillus subtilis) 


P45856 


177 


3-Hydroxybutyryl- 
CoA dehydrogenase 


3-hydroxybutyryl-CoA dehydrogenase 
protein {Bacillus thuringiensis serovar 
kurstaki) 


AAL32270 


174 


3 -hydroxybutyryl - 
CoA dehydrogenase 


3-hydroxybutyryl-CoA dehydrogenase 
{Deinococcus radiodurans) 


NP_294792 


167 


3 -hydroxybutyryl - 
CoA dehydrogenase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). .J' 



The orf26 gene encodes a protein exhibiting 65% identity (determined using the 
BLAST program) with the protein encoded by the tylCV gene which encodes a 
mycarosyltransferase involved in tylosine biosynthesis in Streptomyces fradiae (N. Bate, 
et al. 9 2000; GenBank accession number: AAD41824, BLAST score: 471). More 
particularly, TylCV is a glycosyltransferase which binds the mycarose molecule during 
tylosine synthesis. This similarity with a protein involved in the biosynthetic pathway 
for another relatively close antibiotic, and more particularly in mycarose transfer, 
suggests that the orf26 gene is a glycosyltransferase. This hypothesis is supported by the 
fact that the protein encoded by the orf26 gene exhibits strong similarity with other 
proteins of similar function in other organisms (cf. table 33). 
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Table 33 



Protein exhibiting sipnifirant similarity/ 


vj CI LD dj IK. 

accession 
number 


score 


ixeponeo runction 


Glycosyltransferase 
{Streptomyces avermitilis) 


BAA84592 


218 


Glycosyltransferase 


CalG4 {Micromonospora echinospora) 


AAM70365 


217 


Glycosyltransferase 


CalG2 {Micromonospora echinospora) 


AAM70348 


197 


Glycosyltransferase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
aL, 1990). 



The orf27 gene encodes a protein exhibiting a 70% identity (determined using the 
BLAST program) with the protein encoded by the tylCVII gene which encodes an NDP- 
hexose 3,5- (or 5-) epimerase involved in tylosine biosynthesis in Streptomyces fradiae 
(N. Bate et al. 9 2000; GenBank accession number: AAD41825, BLAST score: 243). 
More particularly, TylCVII is a hexose 3,5- (or 5-) epimerase involved in mycarose 
biosynthesis. This similarity with a protein involved in the biosynthetic pathway for 
another relatively close antibiotic, and more particularly in mycarose biosynthesis, 
suggests that the or/27 gene encodes an epimerase. This hypothesis is supported by the 
fact that the protein encoded by the orf27 gene exhibits strong similarity with other 
proteins of similar function in other organisms (cf. table 34). Analysis of the close 
sequences obtained using the BLAST program strongly suggests that the or/27 gene 
encodes a 5-epimerase responsible for the epimerization reaction necessary for mycarose 
biosynthesis (cf. figure 4). 
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Table 34 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


LanZl (Streptomyces cyanogenns) 


AAD13558 


172 


NDP-hexose 3,5- 
epimerase 


Epi {Saccharopolyspora spinosa) 


AAK83288 


169 


TDP-4-keto-6- 
deoxyglucose 3,5- 
epimerase 


dNTP-hexose 3,5-epimerase 
(Amycolatopsis mediterranei) 


AAC01732 


166 


dNTP-hexose 3,5- 
epimerase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et 
al. y 1990). 



The sequence of orf28c was initially determined partially, since the sequence of a 
5 region of approximately 450 base pairs was only determined after resequencing (this 
region is symbolized by "N" in the incomplete sequence SEQ ID No. 106). The partial 
sequence of this ORF (SEQ ID No. Ill) was nevertheless used for the analysis with the 
various computer programs as explained above. It was thus possible to determine that 
the orf28c gene encodes a protein exhibiting 64% identity, over the determined sequence 

10 (SEQ ID No. 112, which is the partial sequence of the Orf28c protein) (determined using 
the BLAST program), with the protein encoded by the acyB2 gene which encodes a 
regulatory protein involved in carbomycin biosynthesis in Streptomyces thermotolerans 
(A. Arisawa et al, 1993; GenBank accession number: JC2032, BLAST score: 329). This 
similarity with a protein involved in the biosynthetic pathway for a relatively close 

15 antibiotic suggests that the orf28c gene encodes a regulatory protein involved in 
spiramycin biosynthesis. This hypothesis is supported by the fact that the protein 
encoded by the orf28c gene also exhibits strong similarity with the TylR protein, which 
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is a regulatory protein involved in tylosine biosynthesis in Streptomyces fradiae (N. Bate 
et aL, 1999; GenBank accession number: AAF29380, BLAST score: 167). 

It was possible to amplify the orf28c gene using oligonucleotides located on either 
side of the undetermined sequence and to subclone it into an expression vector. It was 
5 thus possible to demonstrate that overexpression of the orf28c gene significantly 
increases spiramycin production in the OSC2 strain (cf. example 24). This demonstrates 
that overexpresion of orf28c leads to an increase in spiramycin production and confirms 
its role as a regulator of the biosynthetic pathway for spiramycins. 

The partial sequence of orf28c was subsequently completed and the missing region 

10 of approximately 450 base pairs was determined (cf. SEQ ID No. 140 and SEQ ID 
No. 141). The complete sequence of this ORF (SEQ ID No. 141) was used for the 
analysis with the various computer programs as explained above. It was thus possible to 
determine that the orf28c gene encodes a protein exhibiting 69% identity over the 
sequence determined (SEQ ID No. 142, which is the complete sequence of the Orf28c 

15 protein) (determined using the BLAST program), with the protein encoded by the acyB2 
gene which encodes a regulatory protein involved in carbomycin biosynthesis in 
Streptomyces thermotolerans (Arisawa, A., et aL, 1993; GenBank accession number: 
JC2032, BLAST score: 451). This similarity with a protein involved in regulating the 
biosynthesis of a relatively close antibiotic suggests that the orf28c gene encodes a 

20 regulatory protein involved in spiramycin biosynthesis. This hypothesis is supported by 
the fact that the protein encoded by the orf28c gene also exhibits strong similarity with 
the TylR protein, which is a regulatory protein involved in regulating tylosine 
biosynthesis in Streptomyces fradiae (Bate, N. et aL, 1999; GenBank accession number: 
AAF29380, BLAST score: 224). The results of overexpression of this gene (cf. example 

25 24) confirm its role as a regulator of the spiramycin biosynthesis pathway. 

The orf28c gene was inactivated. It was thus possible to show that the resulting 
strain no longer produces spiramycins. This confirms that the orf28c gene is clearly 
involved in spiramycin biosynthesis and is essential to spiramycin biosynthesis. These 
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results, combined with the results of overexpression of this gene (cf. example 24), are in 
agreement with Orf28c having a role as an activator essential to spiramycin biosynthesis. 

The or/29 gene encodes a protein exhibiting 31% identity (determined using the 
BLAST program) with a probable glycosyl hydrolase located in the gene group involved 
in the biosynthesis of soraphen A (an antifungal agent of the polyketide class) in 
Sorangium cellulosum (J. Ligon et aL, 2002; GenBank accession number: AAK 19890, 
BLAST score: 139). This similarity with a protein involved in the biosynthetic pathway 
for a relatively close molecule suggests that the orf29 gene encodes a protein having 
glycosyl hydrolase activity. This hypothesis is supported by the fact that the protein 
encoded by the orf29 gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 35). Analysis of the sequence of the protein 
encoded by orf29 using the CD-search program (cf. above) also suggests that the or/29 
gene encodes a glycosyl hydrolase. 



Table 35 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


ManA (Caldicellulosiruptor 
saccharolyticus) 


AAC44232 


136 


beta- 1 ,4-mannanase 


ManA {Dictyoglomus thermophilum) 


AAB82454 


129 


beta-mannanase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et aL, 
1990). 



Analysis of the protein sequence deduced from or/29, using the SignalP program 
(http://www.cbs.dtu.dk/services/SignalP/) (Nielsen, H., et aL, 1997), shows that this 
protein has a C-terminal signal sequence with a predicted cleavage site between 
positions 30 and 31 (QSA/QA). It may be predicted that this protein is extracellular. It 
might, as a glycosylhydrolase, have a role in the reactivation of spiramycin inactivated 
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by glycosylation by the glycosyltransferases GimA and/or GimB (Gourmelen et al, 
1998). 

The orfiOc gene encodes a protein exhibiting 31% identity (determined using the 
BLAST program) with an nucleoside-diphosphate sugar epimerase in Corynebacterium 
glutamicum (GenBank accession number: NP_600590, BLAST score: 89). This 
similarity suggests that the orfiOc gene encodes an epimerase. This hypothesis is 
supported by the fact that analysis of the sequence using the CD-search program 
(cf. above) also suggests that the orfiOc gene encodes an epimerase. 

orfiOc exhibits two possible initiation codons (cf. SEQ ID No. 115) which give 
two possible proteins of 345 and 282 amino acids respectively (SEQ ID No. 116 and 
117). However, the codon usage is typical of Streptomyces only from the second ATG; 
in addition, the deduced protein sequence of the sequence between the first ATG and the 
second does not align with the identified close sequences, whereas the shortest protein 
sequence (from the 2nd ATG: SEQ ID No. 144) aligns correctly with the beginning of 
these proteins. It can therefore be deduced from this that the second ATG is the correct 
initiation codon and that the sequence of this orf is therefore that presented in SEQ ID 
No. 143, which, once translated; corresponds to the protein of sequence SEQ ID 
No. 144. 

The orfi 1 gene encodes a protein exhibiting 52% identity (determined using the 
BLAST program) with an oxidoreductase in Streptomyces coelicolor (GenBank 
accession number: NP_631148, BLAST score: 261). This similarity suggests that the 
or/31 gene encodes a reductase. This hypothesis is supported by the fact that analysis of 
the sequence using the CD-search program (cf. above) also suggests that the orfi 1 gene 
encodes a reductase. This hypothesis is also supported by the fact that the protein 
encoded by the orfi J gene exhibits strong similarity with other proteins of similar 
function in other organisms (cf. table 36). 
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Table 36 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Putative oxidoreductase 
{Streptomyces griseus) 


BAB79295 


173 


Oxidoreductase 


MocA (Xanthomonas axonopodis) 


NP_640644 


171 


Oxidoreductase 



* greater sequence similarity is associated with a higher BLAST score (Altschul et al. 9 
1990). 



The or/3 J gene was inactivated. It was thus possible to show that the resulting 
strain no longer produces spiramycins. This confirms that the orfil gene is indeed 
involved in spiramycin biosynthesis. The enzyme encoded by this gene is therefore 
clearly responsible for a bioconversion step essential to spiramycin biosynthesis. 

The sequence of orf32c was first of all determined partially (cf. example 19), since 
the coding sequence in 5' position was only determined in a second step. The partial 
sequence of this orf (SEQ ID No. 120) was, however, used for the analysis with the 
various computer programs as explained above. It was thus possible to determine that 
the or/3 2c gene encodes a protein exhibiting 47% identity, over the determined sequence 
(SEQ ID No. 121, which is the partial sequence of the Orf32c protein) (determined using 
the BLAST program), with a regulatory protein of the GntR family in Streptomyces 
coelicolor (GenBank accession number: NP 625576, BLAST score: 229). This 
similarity suggests that the or/32c gene encodes a transcriptional regulator of the GntR 
family. This hypothesis is supported by the fact that the protein encoded by the orJ32c 
gene exhibits strong similarity with other proteins of similar function in other organisms. 

The partial sequence of or/32c was subsequently completed and the missing region 
was determined (cf. SEQ ID No. 140 and SEQ ID No. 145). The complete sequence of 
this orf encodes a protein exhibiting 44% identity (determined using the BLAST 
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program) with a regulatory protein of the GntR family in Streptomyces avermitilis 
(GenBank accession number: NP 824604, BLAST score: 282). This similarity suggests 
that the orJ32c gene encodes a transcriptional regulator of the GntR family. This 
hypothesis is supported by the fact that the protein encoded by the orf32c gene exhibits 
strong similarity with other proteins of similar function in other organisms (cf. table 37). 



Table 37 



Protein exhibiting significant similarity 


GenBank 
accession 
number 


BLAST* 
score 


Reported function 


Regulatory protein of the GntR family 
(Streptomyces avermitilis) 


NP_828241 


270 


Regulatory protein of 
the GntR family 


Regulatory protein of the GntR family 
(Streptomyces coelicolor) 


NP_625576 


266 


Regulatory protein of 
the GntR family 


SC5G8.04 (Streptomyces coelicolor) 


NP_628991 


258 


Regulatory protein of 
the GntR family 


Transcriptional regulator 
(Streptomyces venezuelae) 


AAF01064 


224 


Transcriptional 
regulator 


Regulatory protein of the GntR family 
(Streptomyces avermitilis) 


NP_827432 


239 


Regulatory protein of 
the GntR family 



* greater sequence similarity is associated with a higher BLAST score (Altschul et al. 9 
1990). 



The orf32c gene was inactivated with the aim of studying the function of this gene 
in the spiramycin biosynthesis pathway in Streptomyces ambofaciens. It was possible to 
show that the resulting strain produces spiramycins. This indicates that the orf32c gene 
is not essential to spiramycin biosynthesis and that it is not essential to survival of the 
bacterium. 
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The orJ33 gene encodes a protein exhibiting 49% identity (determined using the 
BLAST program) with a hypothetical protein of Xanthomonas campestris (GenBank 
accession number: NP 635564, BLAST score: 54). 

The sequence of orf34c is partial. In fact, the comparisons carried out between the 
product of this orf and the databases suggest that the C-terminal portion of this protein is 
not in the product deduced from the nucleotide sequence and therefore that this orf is 
longer and continues beyond the region sequenced. The partial sequence of this ORF 
was, however, used for the analysis with the various computer programs as explained 
above. It was possible to determine that the orf34c gene encodes a protein exhibiting 
91% identity, over the sequence determined (SEQ ID No. 150, which is the partial 
sequence of the Orf34c protein) (determined using the BLAST program), with an 
arabinofuranosidase from Streptomyces coelicolor (Bentley et al, 2002; GenBank 
accession number: NP_630049, BLAST score: 654). In S. coelicolor, the gene encoding 
this arabinofuranosidase does not appear to be involved in secondary metabolite 
biosynthesis. In S. ambofaciens, this gene is therefore probably not involved in 
spiramycin biosynthesis. 

A subject of the present invention is also polynucleotides which hybridize, under 
high stringency hybridization conditions, to at least one of the polynucleotides of 
sequence SEQ ID Nos 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 28, 30, 34, 36, 40, 43, 
45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 109, 1 1 1, 1 13, 1 15, 
118, 120, 141, 143, 145, 147 and 149, or one of its variants, or one of the sequences 
derived therefrom due to the degeneracy of the genetic code. Preferentially, these said 
polynucleotides are isolated from a bacterium of the genus Streptomyces, more 
preferentially these polynucleotides encode proteins involved in the biosynthesis of a 
macrolide, and even more preferentially these polynucleotides encode a protein having 
activity similar to the protein encoded by the polynucleotides with which they 
hybridize. The high stringency hybridization conditions can be defined as hybridization 
conditions which are not favorable to the hybridization of nonhomologous nucleic acid 
strands. High stringency hybridization conditions may, for example, be described as 
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hybridization conditions in the buffer described by Church & Gilbert (Church & 
Gilbert, 1984) at a temperature of between 55°C and 65°C; the hybridization 
temperature is preferably 55°C, the hybridization temperature is even more preferably 
60°C, and the hybridization temperature is most preferably 65°C, followed by one or 
more washings carried out in 2X SSC buffer at a temperature of between 55°C and 
65°C; this temperature is preferably 55°C, this temperature is even more preferably 
60°C and this temperature is most preferably 65°C, followed by one or more washes in 
0.5X SSC buffer at a temperature of between 55°C and 65°C; this temperature is 
preferably 55°C, this temperature is even more preferably 60°C and this temperature is 
most preferably 65°C. The hybridization conditions described above can be adjusted as 
a function of the length of the nucleic acid the hybridization of which is sought, or of 
the type of labeling chosen, according to techniques known to those skilled in the art. 
Suitable hybridization conditions can, for example, be adjusted according to the work 
by F. Ausubel et al, 2002. 

The invention also relates to a polynucleotide having at least 70%, more 
preferably 80%, more preferably 85%, even more preferably 90%, even more preferably 
95%, and most preferably 98%, nucleotide identity with a polynucleotide comprising at 
least 10, 12, 15, 18, 20 to 25, 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 
400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1100, 1150, 
1200, 1250, 1300, 1350, 1400, 1450, 1500, 1550, 1600, 1650, 1700, 1750, 1800, 1850 
or 1900 consecutive nucleotides of a polynucleotide chosen from the group consisting 
of the nucleotide sequences SEQ ID Nos 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 28, 
30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 
109, 111, 113, 115, 118, 120, 141, 143, 145, 147 and 149, or one of its variants, or one 
of the sequences derived therefrom due to the degeneracy of the genetic code, or a 
polynucleotide of complementary sequence. Preferentially, these said polynucleotides 
are isolated from a bactrium of the genus Streptomyces, more preferentially these 
polynucleotides encode proteins involved in the biosynthesis of a macrolide, and even 
more preferentially these polynucleotides encode proteins having activity similar to the 
proteins encoded by the polynucleotides with which they exhibit the identity. Most 
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preferably, a polynucleotide according to the invention is chosen from the group 
consisting of the nucleotide sequences SEQ ID Nos 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 

25, 28, 30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 
84, 107, 109, 111, 113, 115, 118, 120, 141, 143, 145, 147 and 149, or a polynucleotide 
of complementary sequence. 

The optimal alignment of the sequences for the comparison can be carried out on 
a computer using known algorithms, for example those of the FASTA package 
(W.R. Pearson & D.J. Lipman, 1988) and (W.R. Pearson, 1990), accessible in particular 
from the INFOBIOGEN resource center, Evry, France. By way of illustration, the 
percentage sequence identity may be determined using the LFASTA (K.-M. Chao et al. 9 
x 1992) or LALIGN (X. Huang and W. Miller, 1991) software. The LFASTA and 
LALIGN programs are part of the FASTA package. LALIGN provides optimal local 
alignments; this program is more rigorous, but also slower than LFASTA. 

Another aspect of the invention concerns a polypeptide resulting from the 
expression of a nucleic acid sequence as defined above. Preferentially, the polypeptides 
according to the invention exhibit at least 70%, more preferably 80%, more preferably 
85%, even more preferably 90%, even more preferably 95% and most preferably 98%, 
amino acid identity with a polypeptide comprising at least 10, 15, 20, 30 to 40, 50, 60, 
70, 80, 90, 100, 120, 140, 160, 180, 200, 220, 240, 260, 280, 300, 320, 340, 360, 380, 
400, 420, 440, 460, 480, 500, 520, 540, 560, 580, 600, 620 or 640 consecutive amino 
acids of a polypeptide chosen from SEQ ID Nos 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 

26, 27, 29, 31, 32, 33, 35, 37, 38, 39, 41, 42, 44, 46, 48, 50, 51, 52, 54, 55, 56, 57, 58, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 108, 110, 112, 114, 116, 117, 119, 
121, 142, 144, 146, 148 and 150, or one of these sequences except that, all along said 
sequence, one or more amino acids have been substituted, inserted or deleted without 
affecting the functional properties thereof, or one of the variants of these sequences. 
Preferentially, the polypeptides according to the invention are expressed in a natural 
state by a bacterium of the genus Streptomyces, more preferentially these polypeptides 
are involved in the biosynthesis of a macrolide, and even more preferentially these 
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polypeptides have activity similar to that of the polypeptide with which they share the 
identity. Preferably, a polypeptide according to the invention is chosen from the group 
consisting of the polypeptide sequences SEQ ID Nos 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 27, 29, 31, 32, 33, 35, 37, 38, 39, 41, 42, 44, 46, 48, 50, 51, 52, 54, 55, 56, 57, 
58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 108, 110, 112, 114, 116, 117, 
119, 121, 142, 144, 146, 148 and 150, or one of these sequences except that, all along 
said sequence, one or more amino acids have been substituted, inserted or deleted 
without affecting the functional properties thereof, or one of the variants of these 
sequences. Most preferably, a polypeptide according to the invention is chosen from the 
group consisting of the polypeptide sequences SEQ ID Nos 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 27, 29, 31, 32, 33, 35, 37, 38, 39, 41, 42, 44, 46, 48, 50, 51, 52, 54, 55, 
56, 57, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 108, 110, 1 12, 1 14, 116, 
117, 119, 121, 142, 144, 146, 148 and 150. 

The optimal alignment of the sequences for the comparison can be carried out on 
a computer using known algorithms, for example those of the FASTA package 
(W.R. Pearson & D.J. Lipman, 1988) and (W.R. Pearson, 1990), accessible in particular 
from the INFOBIOGEN resource center, Evry, France. By way of illustration, the 
percentage sequence identity may be determined using the LFASTA (K.-M. Chao et al. 9 
1992) or LALIGN (X. Huang and W. Miller, 1991) software using the default 
parameters as defined by the INFOBIOGEN resource center, Evry, France. The 
LFASTA and LALIGN programs are part of the FASTA package. LALIGN provides 
optimal local alignments; this program is more rigorous, but also slower than LFASTA. 

Another aspect of the invention concerns a recombinant DNA comprising at least 
one polynucleotide as described above. Preferentially, this recombinant DNA is a 
vector. Even more preferentially, the vector is chosen from bacteriophages, plasmids, 
phagemids, integrative vectors, fosmids, cosmids, shuttle vectors, BACs (Bacterial 
Artificial Chromosomes) and PACs (PI -derived Artificial Chromosomes). By way of 
illustration, the lambda phage and the Ml 3 phage may be mentioned as bacteriophages. 
As plasmids, mention may be made of plasmids which replicate in E. coli, for example 
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pBR322 and its derivatives, pUC18 and its derivatives, pUC19 and its derivatives, 
pGB2 and its derivatives (G. Churchward et al, 1984), pACYC177 (GenBank 
accession number: X06402) and its derivatives, and pACYC184 (GenBank accession 
number: X06403) and its derivatives. Mention may also be made of plasmids which 
5 replicate in Streptomyces, such as, for example, pIJlOl and its derivatives, pSG5 and its 
derivatives, SLP1 and its derivatives, and SCP2* and its derivatives (Kieser et al 
2000). As phagemids, mention may be made, by way of illustration, of pBluescript II 
and its derivatives (marketed in particular by the company Stratagene (LaJolla, 
California, USA)), pGEM-T and its derivatives (marketed by the company Promega 

10 (Madison, Wisconsin, USA)), [lacuna] the IS117 integration system (Kieser et al, 
2000). As fosmids, mention may be made, by way of illustration, of the fosmid pFOSl 
(marketed by the company New England Bioloabs Inc., Beverly, Massachusetts, USA) 
and its derivatives. As cosmids, mention may be made, by way of illustration, of the 
cosmid SuperCos and its derivatives (marketed in particular by the company Stratagene 

15 (LaJolla, California, USA)) and the cosmid pWED15 (Wahl et al, 1987) and its 
derivatives. As shuttle vectors, mention may be made, by way of illustration, of 
E. colil Streptomyces shuttle plasmids, such as, for example, pIJ903 and its derivatives, 
the series of plasmids pUWL, pCAO106, pWHM3 and pOJ446 and their derivatives 
(Kieser et al 2000), and E. colil Streptomyces shuttle BACs, such as, for example, those 

20 described in patent application WO 01/40497. As BACs (Bacterial Artificial 
Chromosomes), mention may be made, by way of illustration, of the BAC pBeloBACl 1 
(GenBank accession number: U51 113). As PACs (PI -derived Artificial Chromosomes), 
mention may be made, by way of illustration, of the vector pCYPAC6 (GenBank 
accession number: AF 133437). Most preferably, a vector according to the invention is 

25 chosen from pOS49.1, pOS49.11, pOSC49.12, pOS49.14, pOS49.16, pOS49.28, 
pOS44.1, pOS44.2, pOS44.4, pSPM5, pSPM7, pOS49.67, pOS49.88, pOS49.106, 
pOS49.120, pOS49.107, pOS49.32, pOS49.43, pOS49.44, pOS49.50, pOS49.99, 
pSPM17, pSPM21, pSPM502, pSPM504, pSPM507, pSPM508, pSPM509, pSPMl, 
pBXLllll, pBXL1112, pBXL1113, pSPM520, pSPM521, pSPM522, pSPM523, 

30 pSPM524, pSPM525, pSPM527, pSPM528, pSPM34, pSPM35, pSPM36, pSPM37, 
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pSPM38, pSPM39, pSPM40, pSPM41, pSPM42, pSPM43, pSPM44, pSPM45, 

pSPM47, pSPM48, pSPM50, pSPM51, pSPM52, pSPM53 5 pSPM55, pSPM56, 

pSPM58, pSPM72, pSPM73, pSPM515, pSPM519, pSPM74, pSPM75, pSPM79, 
pSPM83, pSPM107, pSPM543 and pSPM106. 

Another aspect of the invention concerns an expression system comprising a 
suitable expression vector and a host cell allowing the expression of one or more 
polypeptides as described above in a biological system. The expression vectors 
according to the invention comprise a nucleic acid sequence encoding one or more 
polypeptides as described above, and may be intended for the expression of the various 
polypeptides according to the invention in diverse host cells well known to those skilled 
in the art. By way of example, mention may be made of prokaryotic expression systems, 
such as the expression system in the bacterium E. coli, and eukaryotic expression 
systems, such as the baculovirus expression system which allows expression in insect 
cells, and the expression systems which allow expression in yeast cells, or the 
expression systems which allow expression in mammalian cells, in particular human 
cells. The expression vectors which can be used in such systems are well known to 
those skilled in the art; as regards prokayrotic cells, mention may be made, by way of 
illustration, of the expression vectors in E. coli, for example of the pET marketed by the 
company Stratagene (LaJolla, California, USA), the vectors of the GATEWAY family 
marketed by the company Invitrogen (Carlsbad, California, USA), the vectors of the 
pBAD family marketed by the company Invitrogen (Carlsbad, California, USA), the 
vectors of the pMAL family marketed by the company New England Bioloabs Inc. 
(Beverly, Massachusetts, USA), and the rhamnose-inducible expression vectors 
mentioned in the publication B. Wilms et al., 2001 and their derivatives; mention may 
also be made of the expression vectors in Streptomyces, such as, for example, the 
vectors pIJ4123, pIJ6021, pPM927, pANT849, pANT 850, pANT 851, pANT1200, 
pANT1201 and pANT1202 and their derivatives (Kieser et al., 2000). As regards yeast 
cells, mention may be made, by way of illustration, of the vector pESC marketed by the 
company Stratagene (LaJolla, California, USA). As regards the baculovirus expression 
system which allows expression in insect cells, mention may be made, by way of 
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illustration, of the vector BacPAK6 marketed by the company BD Biosciences 
Clontech, (Palo Alto, California, USA). As regards mammalian cells, mention may be 
made, by way of illustration, of the vectors comprising the CMV (Cytomegalovirus) 
immediate early gene promoter (for example the vector pCMV and its derivatives 
marketed by the company Stratagene (LaJolla, California, USA)), or the SV40 early 
promoter of the vacuolating simian virus (for example the vector pSG5 marketed by the 
company Stratagene (LaJolla, California, USA). 

The invention also relates to a method of producing a polypeptide as described 
above, said method comprising the following steps: 

a) inserting a nucleic acid encoding said polypeptide into a suitable vector; 

b) culturing, in a suitable culture medium, a host cell transformed or transfected 
beforehand with the vector step a); 

c) recovering the conditioned culture medium or a cell extract, for example by 
sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or else 
from the cell extract obtained in step c); 

e) where appropriate, characterizing the recombinant polypeptide produced. 

A recombinant polypeptide according to the invention can be purified by passage 
over an appropriate series of chromatography columns, according to the methods known 
to those skilled in the art and described, for example, in F. Ausubel et al 9 (2002). By 
way of illustration, mention may be made of the "Histidine-Tag" technique, which 
consists in adding a short polyhistidine sequence to the polypeptide to be produced, it 
being possible for this polypeptide to be purified on a nickel column. A polypeptide 
according to the invention may also be prepared by in vitro synthesis techniques. By 
way of illustration of such techniques, a polypeptide according to the invention may be 
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prepared using the "rapid translation system (RTS)'\ marketed in particular by the 
company Roche Diagnostics France S.A., Meylan, France. 

Another aspect of the invention concerns host cells into which has been 
introduced at least one polynucleotide and/or at least one recombinant DNA and/or at 
least one expression vector according to the invention. 

Another aspect of the invention concerns microorganisms blocked in a step of the 
biosynthetic pathway for at least one macrolide. The advantage lies, firstly, in studying 
the function of the mutated proteins and, secondly, in producing microorganisms which 
produce biosynthesis intermediates. These intermediates can be modified, optionally 
after separation, either by adding particular components to the production media, or by 
introducing into the microorganisms thus mutated other genes encoding proteins 
capable of modifying the intermediate by using it as a substrate. These intermediates 
can thus be modified chemically, biochemically, enzymatically and/or 
microbiologically. The microorganisms blocked in a step of the biosynthetic pathway 
for macrolides can be obtained by inactivating the function of one or more proteins 
involved in the biosynthesis of this or these macrolide(s) in microorganisms which 
produce this or these macrolide(s). Depending on the protein(s) inactivated, 
microorganisms blocked in the various steps of the biosynthetic pathway for this or 
these macrolide(s) may thus be obtained. The inactivation of this or these protein(s) can 
be carried out by any method known to those skilled in the art, for example by 
mutagenesis in the gene(s) encoding said protein(s) or by the expression of one or more 
antisense RNA(s) complementary to the messenger RNA(s) encoding said protein(s). 
The mutagenesis can, for example, be carried out by irradiation, by action of a 
mutagenic chemical agent, by solid-directed mutagenesis, by gene replacement, or any 
other method known to those skilled in the art. The conditions suitable for such a 
mutagenesis can, for example, be adjusted according to the teaching contained in the 
works by T. Kieser et al, (2000) and Ausubel et al. 9 (2002). The mutagenesis can be 
carried out in vitro or in situ, by suppression, substitution, deletion and/or addition of 
one or more bases in the gene under consideration, or by gene inactivation. This 
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mutagenesis can be carried out in a gene comprising a sequence as described above. 
Preferentially, the microorganisms blocked in a step of the biosynthetic pathway for 
macrolides are bacteria of the genus Streptomyces. More preferentially, the inactivation 
of the function of one or more proteins involved in the biosynthesis of the macrolide(s) 
in question is carried out by mutagenesis. Even more preferentially, the macrolide in 
question is spiramycin and the microorganisms in which the mutagenesis or 
mutageneses is (are) carried out are strains of 5. ambofaciens. More preferentially, the 
mutagenesis is carried out in one or more genes comprising one of the sequences 
corresponding to one or more of the sequences SEQ ID Nos 3, 5, 7, 9, 1 1, 13, 15, 17, 
19, 21, 23, 25, 28, 30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 107, 109, 111, 113, 115, 118, 120, 141, 143, 145, 147 and 149. 
Preferably, the mutagenesis or mutageneses is (are) carried out by gene inactivation. 
Most preferably, the mutagenesis consists of gene inactivation of a gene comprising a 
sequence corresponding to the sequence SEQ ID No. 13. 

By way of illustration, the following microorganisms may be mentioned as 
examples of such microorganisms: OS49.16 (or/3::. Qhyg, cf. example 2), OS49.67 
(or/3 in-phase deletion, cf example <5), OS49.107 (orfSy. Ohyg, cf. example 7), OS49.50 
(or/10:: Hhyg, cf. example 8), SPM21 (prf2::att3Qaac-, cf. example 10), SPM22 
(prf2::att3 in-phase deletion, cf. example 10), SPM501 (orf6*::attinhyg+, cf. example 
14), SPM502 (orf6*::attJ in-phase deletion, cf. example 14), SPM507 
(prfl2::att3Qaac-, cf. example 11), SPM508 (prfl3c::att3Qaac-, cf. example 12), and 
SPM509 (orfl4::att3Qaac-, cf. example 13), SPM107 (orf28c::att3aac+, cf. example 
29), SPM543 (orj31::att3aac+, cf. example 30), SPM106 (or/32c::att3aac+, 
cf. example 31). 

Another aspect of the invention concerns a method of preparing a macrolide 
biosynthesis intermediate, using the microorganisms blocked in a step of the 
biosynthetic pathway for macrolides, as described above. The method consists in 
culturing, in a suitable culture medium, a microorganism blocked in a step of the 
biosynthetic pathway for macrolides, as described above, recovering the conditioned 
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culture medium or a cell extract, for example by sonification or by osmotic shock, and 
separating and purifying said biosynthesis intermediate from said culture medium or 
else from the cell extract obtained in the preceding step. The conditions for culturing 
such microorganisms may be determined according to techniques well known to those 
skilled in the art. The culture medium may, for example, be the MP5 medium or the 
SL11 medium for Streptomyces, and in particular for Streptomyces ambofaciens 
(Pemodet et al, 1993). Those skilled in the art may in particular refer to the work by 
Kieser et al., (2000) as regards the culturing of Streptomyces. The intermediate 
produced can be recovered by any technique known to those skilled in the art. Those 
skilled in the art may, for example, refer to the techniques taught in US patent 
3,000,785, and more particularly to the methods for extracting spiramycins described in 
that patent. 

Another subject of the invention concerns a method of preparing a molecule 
derived from a macrolide, using the microorganisms blocked in a step of the 
biosynthetic pathway for this macrolide, as described above. The method consists in 
obtaining a biosynthesis intermediate according to the method above and in modifying 
the intermediate thus produced, optionally after separation from the culture medium. 
The conditions for culturing such microorganisms may be determined according to 
techniques well known to those skilled in the art. The culture medium may, for 
example, be the MPS medium or the SL1 1 medium for Streptomyces, and in particular 
for Streptomyces ambofaciens (Pemodet et al., 1993). Those skilled in the art may in 
particular refer to the work by Kieser et al, 2000 with regard to the culturing of 
Streptomyces. The intermediates produced may be modified, optionally after separation, 
either by adding suitable components to the production media, or by introducing into 
the microorganisms other genes encoding proteins capable of modifying the 
intermediate by using it as a substrate. These intermediates can thus be modified 
chemically, biochemically, enzymatically and/or microbiologically. More preferentially, 
the macrolide in question is spiramycin and the microorganisms in which the 
mutagenesis or mutageneses is (are) carried out are strains of S. ambofaciens. 



82 



The invention also relates to a microorganism which produces spiramycin I but 
which does not produce spiramycin II and III. This microorganism comprises all of the 
genes necessary for the biosynthesis of spiramycin I but does not produce spiramycin II 
and III since the gene comprising the sequence corresponding to SEQ ID No. 13, or one 
of its variants, or one of the sequences derived therefrom due to the degeneracy of the 
genetic code, and encoding a polypeptide of sequence SEQ ID No. 14, or one of its 
variants, is not expressed or has been made inactive. The inactivation of this protein can 
be carried out by any method known to those skilled in the art, for example by 
mutagenesis in the gene encoding said protein or by the expression of antisense RNA 
complementary to the messenger RNA encoding said protein, it being understood that, 
if the expression of orp* is affected by this manipulation, it will be necessary to 
perform another modification so that the orf5* gene is correctly expressed. The 
mutagenesis may be carried out in the coding sequence or in a noncoding sequence so 
as to make the encoded protein inactive or to prevent its expression or its translation 
therefrom. The mutagenesis may be carried out by site-directed mutagenesis, by gene 
replacement or any other method known to those skilled in the art. The conditions 
suitable for such a mutagenesis may, for example, be adjusted according to the teaching 
contained in the works by T. Kieser et al, (2000) and Ausubel et ai, 2002. The 
mutagenesis may be carried out in vitro or in situ, by suppression, substitution, deletion 
and/or addition of one or more bases in the gene under consideration or by gene 
inactivation. The microorganism may also be obtained by expressing the genes of the 
biosynthetic pathway for spiramycin without them comprising the gene comprising the 
sequence SEQ ID No. 13, or one of its variants, or one of the sequences derived 
therefrom due to the degeneracy of the genetic code, and encoding a polypeptide of 
sequence SEQ ID No. 14, or one of its variants. Preferentially, the microorganism is a 
bacterium of the genus Streptomyces. More preferentially, the microorganism which 
produces spiramycin I but which does not produce spiramycin II and III is obtained 
from a starting microorganism which produces spiramycins I, II and III. Even more 
preferentially, the microorganism is obtained by mutagenesis in a gene comprising the 
sequence corresponding to SEQ ID No. 13, or one of its variants, or one of the 
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sequences derived therefrom due to the degeneracy of the genetic code, and encoding a 
polypeptide of sequence SEQ ID No. 14, or one of its variants having the same 
function. Even more preferentially, this mutagenesis is carried out by gene inactivation. 
Preferably, the microorganism is obtained from a strain of S. ambofaciens which 
produces spiramycins I, II and III, in which gene inactivation of the gene comprising the 
sequence corresponding to SEQ ID No. 13, or one of the sequences derived therefrom 
due to the degeneracy of the genetic code, is carried out. Most preferably, the gene 
inactivation is carried out by in-phase deletion of the gene or of a part of the gene 
comprising the sequence corresponding to SEQ ID No. 13, or one of the sequences 
derived therefrom due to the degeneracy of the genetic code. By way of illustration, the 
strain SPM502 {prf6*::attl, cf. example 14) may be mentioned as a microorganism 
which produces spiramycin I but which does not produce spiramycin II and III. 

The invention also relates to a microorganism which produces spiramycin I but 
which does not produce spiramycin II and III, as described above, which also 
overexpresses: 

- a gene which can be obtained by polymerase chain reaction using the pair of primers of 
the following sequence: 5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ 
ID No. 138) and 5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID 
No. 139), and as matrix the cosmid pSPM36 or the total DNA of Streptomyces 
ambofaciens, more preferably it is the gene of coding sequence SEQ ID No. 141, 

- or a gene derived therefrom due to the degeneracy of the genetic code. 

An example of sequence of such a gene is given in SEQ ID No. 1 1 1 (DNA); however, 
this sequence is partial since it does not comprise the 3' portion of the corresponding 
coding sequence. The translation of this portion of coding sequence into protein is given 
in SEQ ID No. 112. Those skilled in the art will easily be able to complete it in 
particular using the teaching given in example 24. The sequence undetermined in SEQ 
ID No. Ill was determined in a second step, and the complete sequence of this orf 
(orf28c) is given in SEQ ID No. 141. The translation into protein of this coding 
sequence is given in SEQ ID No. 142. Example 24 gives a method for cloning the orf28c 
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gene and for producing an expression vector which allows the expression of orf28c. This 
example also shows that overexpression of the orf28c gene in the strain OSC2 leads to 
the improvement of spiramycin production in this strain. The overexpression of the 
orf28c gene can be obtained by increasing the number of copies of this gene and/or by 
introducing a promoter which is more active than the wild-type promoter. Preferably, the 
overexpression of said gene is obtained by introducing into the microorganism a 
recombinant DNA construct which allows overexpression of this gene. Preferably, this 
recombinant DNA construct increases the number of copies of said gene and makes it 
possible to obtain overexpression of said gene. In this recombinant DNA construct, the 
coding sequence of the gene can be placed under the control of a promoter which is 
more active than the wild-type promoter. By way of illustration, mention may be made 
of the ptrc promoter which is active in Streptomyces ambofaciens (E. Amann et aL, 
1988) and also the ermE* promoter. Thus, preferably, the copy or copies of the orf28c 
gene introduced is (are) placed under the control of the ermE* promoter, as is the case in 
the construct pSPM75 (cf. example 24). 

The invention also relates to a microorganism which produces spiramycin I but 
which does not produce spiramycin II and III, as described above, which also 
overexpresses the gene having coding sequence SEQ ID No. 47 or having a coding 
sequence derived therefrom due to the degeneracy of the genetic code. Preferentially, 
this microorganism is the strain SPM502 pSPM525 deposited with the Collection 
Nationale de Cultures de Microorganismes [National Collection of Cultures and 
Microorganisms] (CNCM) Pasteur Institute, 25, rue du Docteur Roux 75724 Paris 
Cedex 15, France, on February 26, 2003, under the registration number 1-2977. 

The invention also relates to a method of producing spiramycin I; the method 
consists in culturing, in a suitable culture medium, a microorganism which produces 
spiramycin I but which does not produce spiramycin II and III, as described above, 
recovering the conditioned culture medium or a cell extract, and separating and 
purifying the spiramycin I from said culture medium or else from the cell extract 
obtained in the preceding step. The conditions for culturing such a microorganism may 
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be determined according to techniques well known to those skilled in the art. The 
culture medium may, for example, be the MP5 medium or the SL11 medium for 
Streptomyces, and in particular for Streptomyces ambofaciens (Pernodet et ai y 1993). 
Those skilled in the art may in particular refer to the work by Kieser et aL, 2000 
regarding the culturing of Streptomyces. The spiramycin I produced can be recovered by 
any techniques known to those skilled in the art. Those skilled in the art may, for 
example, refer to the techniques taught in US patent 3,000,785, and more particularly to 
the methods for extracting spiramycins described in that patent. 

Another aspect of the invention concerns the use of a nucleotide sequence 
according to the invention, for improving the macrolide production of a microorganism. 
Thus, the invention relates to a macrolide-producing mutant microorganism which has a 
genetic modification in at least one gene comprising a sequence as defined above, and/or 
which overexpresses at least one gene comprising a sequence as defined above. The 
genetic modification may consist of a suppression, a substitution, a deletion and/or an 
addition of one or more bases in the gene(s) under consideration with the aim of 
expressing one or more proteins having greater activity or of expressing a higher level of 
this or these protein(s). The overexpression of the gene under consideration can be 
obtained by increasing the number of copies of this gene and/or by introducing a 
promoter which is more active than the wild-type promoter. By way of illustration, 
mention may be made of the ptrc promoter which is active in Streptomyces ambofaciens 
(E. Amann et al. 9 1988) and also the ermE* promoter (Bibb et ai, 1985), (Bibb et aL, 
1994). Thus, the overexpression of the gene under consideration can be obtained by 
introducing, into a macrolide-producing microorganism under consideration, a 
recombinant DNA construct according to the invention which allows overexpression of 
this gene. Specifically, certain steps of macrolide biosynthesis are limiting and, if one or 
more proteins which are more active or a higher expression level of the wild-type 
protein(s) involved in these limiting steps are expressed, it is possible to improve the 
production of the macrolide(s) concerned. An increase in the rate of production of 
tylosine has, for example, been obtained in Streptomyces fradiae by duplication of the 
gene encoding a limiting methyltransferase which converts macrocine to tylosine 
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(R. Baltz, 1997). The production of expression of a protein which is more active can be 
obtained in particular by mutagenesis; those skilled in the art may, for example, refer in 
this respect to the work by F. Ausubel et al, (2002). Preferentially, these mutant 
microorganisms improved in terms of their macrolide production are bacteria of the 
genus Streptomyces. More preferentially, the macrolide in question is spiramycin and the 
microorganisms in which the mutagenesis or mutageneses is (are) carried out are strains 
of S. ambofaciens. More preferentially, the genetic modification is carried out in one or 
more genes comprising one of the sequences corresponding to one or more of the 
sequences SEQ ID Nos 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 28, 30, 34, 36, 40, 43, 
45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 107, 109, 1 1 1, 1 13, 1 15, 
118, 120, 141, 143, 145, 147 and 149, or one of its variants, or one of the sequences 
derived therefrom due to the degeneracy of the genetic code. Preferably, the 
microorganism overexpresses one or more genes comprising one of the sequences 
corresponding to one or more of the sequences SEQ ID Nos 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 
21, 23, 25, 28, 30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 
80, 82, 84, 107, 109, 111, 113, 115, 118, 120, 141, 143, 145, 147 and 149, or one of its 
variants, or one of the sequences derived therefrom due to the degeneracy of the genetic 
code. Preferably, the microorganism overexpresses the gene comprising a sequence 
corresponding to SEQ ID No. 1 1 1 or 1 4 1 , or one of its variants, or one of the sequences 
derived therefrom due to the degeneracy of the genetic code. The sequence given in SEQ 
ID No. Ill is partial; however, those skilled in the art will easily be able to complete it 
in particular using the teaching given in example 24. The sequence undetermined in 
SEQ ID No. 1 1 1 was determined in a second step, and the complete sequence of this orf 
(orf28c) is given in SEQ ID No. 141. The translation into protein of this coding 
sequence is given in SEQ ID No. 142. Example 24 gives a method for cloning the orf28c 
gene and for producing an expression vector which allows the expression oiorj28c. This 
example also shows that overexpression of the orf28c gene in the strain OSC2 leads to 
the improvement of spiramycin production in this strain. The overexpression of the 
orf28c gene can be obtained by increasing the number of copies of this gene and/or by 
introducing a promoter which is more active than the wild-type promoter. Preferably, the 
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overexpression of said gene is obtained by introducing into the microorganism a 
recombinant DNA construct which allows overexpression of this gene. Preferably, this 
recombinant DNA construct increases the number of copies of said gene and makes it 
possible to obtain overexpression of said gene. In this recombinant DNA construct, the 
coding sequence of the gene can be placed under the control of a promoter which is 
more active than the wild-type promoter. By way of illustration, mention may be made 
of the ptrc promoter which is active in Streptomyces ambofaciens (E. Amann et al, 
1988) and also the ermE* promoter. Thus, preferably, the copy or copies of the orf28c 
gene introduced is (are) placed under the control of the ermE* promoter, as is the case in 
the construct pSPM75 (cf. example 24). 

Another aspect of the invention concerns a method of producing macrolides, 
using the microorganisms described in the preceding paragraph. This method consists in 
culturing, in a suitable culture medium, a microorganism defined in the preceding 
paragraph, recovering the conditioned culture medium or a cell extract, and separating 
and purifying the macrolide(s) produced from said culture medium or else from the cell 
extract obtained in the preceding step. The conditions for culturing such 
microorganisms may be determined according to techniques well known to those skilled 
in the art. The culture medium rtiay, for example, be the MP5 medium or the SL11 
medium for Streptomyces, and in particular for Streptomyces ambofaciens (Pernodet et 
al. 9 1993). Those skilled in the art may in particular refer to the work by Kieser et al. 9 
2000 regarding the culturing of Streptomyces. The macrolide(s) produced can be 
recovered by any techniques known to those skilled in the art. Those skilled in the art 
may, for example, refer to the techniques taught in US patent 3,000,785, and more 
particularly to the methods for extracting spiramycins described in that patent. 
Preferentially, the microorganisms used in such a method are bacteria of the genus 
Streptomyces. More preferentially, the macrolide in question is spiramycin and the 
mutant microorganisms improved in terms of their spiramycin production are strains of 
S. ambofaciens. 



Another aspect of the invention concerns the use of a sequence and/or of a vector 
according to the invention, for preparing hybrid antibiotics. Specifically, the 
polynucleotides according to the invention may be used to obtain microorganisms 
expressing one or more mutant proteins giving rise to a modification in the substrate 
5 specificity, or else may be expressed in many antibiotic-producing microorganisms with 
the aim of generating hybrid antibiotics. Thus, the polynucleotides according to the 
invention may make it possible, by gene transfer between producer microorganisms, to 
produce hybrid antibiotics having advantageous pharmacological properties (Hopwood 
et al, 1985a, Hopwood et al, 1985b, Hutchinson et al., 1989). The principle by which 

10 genetic engineering can bring about the production of hybrid antibiotics was first of all 
proposed by Hopwood (Hopwood 1981). It was thus proposed that the enzymes 
involved in the biosynthesis of antibiotics often accept substrates which are structurally 
related but which differ from their natural substrate. It is generally accepted (Hopwood 
1981, Hutchinson 1988, Robinson 1988) that the enzymes encoded by the genes of the 

15 biosynthetic pathway for antibiotics have a less strict substrate specificity than the 
enzymes of primary metabolism. It has thus been possible to show that a large number 
of non-natural substrates are converted by antibiotic-producing microorganisms, their 
mutants or purified enzymes .of the biosynthetic pathway for these antibiotics 
(Hutchinson 1988). Using this teaching, those skilled in the art may construct 

20 microorganisms expressing one or more mutant proteins giving rise to a modification in 
the substrate specificity with the aim of generating hybrid antibiotics. 

The invention also relates to the use of at least one polynucleotide and/or at least 
one recombinant DNA and/or at least one expression vector and/or at least one 
polypeptide and/or at least one host cell according to the invention, for carrying out one 

25 or more byconversions. Thus, the invention makes it possible to construct bacterial or 
fungal strains in which one or more proteins according to the invention are expressed 
under the control of suitable expression signals. Such strains can then be used to carry 
out one or more byconversions. These byconversions may be carried out either using 
whole cells, or using acellular extracts of said cells. These byconversions may make it 

30 possible to convert a molecule into a derived form, with an enzyme of a biosynthetic 
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pathway. For example, Carreras et al describe the use of a strain of Saccharopolyspora 
erythraea and of Streptomyces coelicolor for producing new erythromycin derivatives 
(Carreras et al, 2002). Walczak et al describe the use of a Streptomyces P450 
monooxygenase for the bioconversion of desacetyladriamycin (an anthracycline analog) 
to novel anthracyclines (Walczak et al, 2001). Olonao et al describe the use of a 
modified strain of Streptomyces lividans for the bioconversion of epsilon- 
rhodomycinone to rhodomycin D (Olonao et al, 1999). Those skilled in the art may 
apply this principle to any biosynthesis intermediate. 

The invention also relates to a recombinant DNA which comprises: 

- a polynucleotide which can be obtained by polymerase chain reaction using the pair 
of primers of the following sequence: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 
5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) and as 
matrix the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, more 
preferably it is a polynucleotide of sequence SEQ ID No. 141, 

- or a fragment of at least 10, 12, 15, 18, 20 to 25, 30, 40, 50, 60, 70, 80, 90, 100, 
150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 
1000, 1050, 1100, 1150, 1200, 1250, 1300, 1350, 1400, 1450, 1460, 1470, 1480, 
1490 or 1500 consecutive nucleotides of the polynucleotide. 

Preferentially, this recombinant DNA is a vector. Even more preferentially, the 
vector is chosen from bacteriophages, plasmids, phagemids, integrative vectors, 
fosmids, cosmids, shuttle vectors, BACs (Bacterial Artificial Chromosomes) and 
PACs (PI -derived Artificial Chromosomes). By way of illustration, the lambda 
phage and the Ml 3 phage may be mentioned as bacteriophages. As plasmids, 
mention may be made of plasmids which replicate in E. coli, for example pBR322 
and its derivatives, pUC18 and its derivatives, pUC19 and its derivatives, pGB2 and 
its derivatives (G. Churchward et al, 1984), pACYC177 (GenBank accession 
number: X06402) and its derivatives, and pACYC184 (GenBank accession number: 
X06403) and its derivatives. Mention may also be made of plasmids which replicate 



in Streptomyces, such as, for example, pIJlOl and its derivatives, pSG5 and its 
derivatives, SLP1 and its derivatives, and SCP2* and its derivatives (Kieser et al 
2000). As phagemids, mention may be made, by way of illustration, of pBluescript II 
and its derivatives (marketed in particular by the company Stratagene (LaJolla, 
5 California, USA)), pGEM-T and its derivatives (marketed by the company Promega 
(Madison, Wisconsin, USA)), and XZAPII and its derivatives (marketed in particular 
by the company Stratagene (LaJolla, California, USA)). As integrative vectors, 
mention may be made, by way of illustration, of vectors which integrate in 
Streptomyces, such as, for example, those derived from SLP1 (Kieser et al 9 2000), 

10 those derived from pSAM2 (Kieser et al, 2000), vectors which use PhiC31 phage 
integration systems (Kieser et al, 2000) (for example pSET152 (Bierman et al, 
1992)) or VWB integration systems (L. van Mellaert et al 1998), and also vectors 
which use the IS117 integration system (Kieser et al, 2000). As fosmids, mention 
may be made, by way of illustration, of the fosmid pFOSl (marketed by the company 

15 New England Bioloabs Inc., Beverly, Massachusetts, USA) and its derivatives. As 
cosmids, mention may be made, by way of illustration, of the cosmid SuperCos and 
its derivatives (marketed in particular by the company Stratagene (LaJolla, 
California, USA)) and the cosmid pWED 15 (Wahl et al, 1987) and its derivatives. 
As shuttle vectors, mention may be made, by way of illustration, of 

20 E. colil Streptomyces shuttle plasmids, such as, for example, pIJ903 and its 
derivatives, the series of plasmids pUWL, pCAO106, pWHM3 and pOJ446 and their 
derivatives (Kieser et al 2000), and E. colil Streptomyces shuttle BACs, such as, for 
example, those described in patent application WO 01/40497. As BACs (Bacterial 
Artificial Chromosomes), mention may be made, by way of illustration, of the BAG 

25 pBeloBACll (GenBank accession number: U51113). As PACs (Pl-derived 
Artificial Chromosomes), mention may be made, by way of illustration, of the vector 
pCYPAC6 (GenBank accession number: AF1 33437). More preferentially, this 
recombinant DNA is an expression vector. The expression vectors which can be used 
in such systems are well known to those skilled in the art; as regards prokayrotic 

30 cells, mention may be made, by way of illustration, of the expression vectors in 
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E. coli, for example of the pET marketed by the company Stratagene (LaJolla, 
California, USA), the vectors of the GATEWAY family marketed by the company 
Invitrogen (Carlsbad, California, USA), the vectors of the pBAD family marketed by 
the company Invitrogen (Carlsbad, California, USA), the vectors of the pMAL family 
marketed by the company New England Bioloabs Inc. (Beverly, Massachussetts, 
USA), and the rhamnose-inducible expression vectors mentioned in the publication 
B.Wilms et al, 2001 and their derivatives; mention may also be made of the 
expression vectors in Streptomyces, such as, for example, the vectors pIJ4123, 
pIJ6021, pPM927, pANT849, pANT 850, pANT 851, pANT1200, pANT1201 and 
pANT1202 and their derivatives (Kieser et aL 9 2000). As regards yeast cells, mention 
may be made, by way of illustration, of the vector pESC marketed by the company 
Stratagene (LaJolla, California, USA). As regards the baculovirus expression system 
which allows expression in insect cells, mention may be made, by way of illustration, 
of the vector BacPAK6 marketed by the company BD Biosciences Clontech, (Palo 
Alto, California, USA). As regards mammalian cells, mention may be made, by way 
of illustration, of the vectors comprising the CMV (Cytomegalovirus) immediate 
early gene promoter (for example the vector pCMV and its derivatives marketed by 
the company Stratagene (LaJolla,/ California, USA)), or the SV40 early promoter of 
the vacuolating simian virus (for example the vector pSG5 marketed by the company 
Stratagene (LaJolla, California, USA)). Another aspect of the invention relates to 
host cells into which at least one recombinant DNA described in this paragraph has 
been introduced. 

Another aspect of the invention concerns a method of producing a 
polypeptide, wherein said method comprises the following steps: 

a) transforming a host cell with at least one expression vector as described in the 
paragraph above; 

b) culturing, in a suitable culture medium, said host cell; 

c) recovering the conditioned culture medium or a cell extract; 

d) separating and purifying said polypeptide from said culture medium or else 
from the cell extract obtained in step c); 
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e) where appropriate, characterizing the recombinant polypeptide produced. 

The recombinant polypeptide thus produced can be purified by passage over an 
appropriate series of chromatography columns, according to the methods known to 
those skilled in the art and described, for example, in F. Ausubel et al. y (2002). By way 
5 of illustration, mention may be made of the "Histidine-Tag" technique, which consists 
in adding a short polyhistidine sequence to the polypeptide to be produced, it being 
possible for this polypeptide to be purified on a nickel column. This polypeptide may 
also be prepared by in vitro synthesis techniques. By way of illustration of such 
techniques, the polypeptide may be prepared using the "rapid translation system 
10 (RTS)", marketed in particular by the company Roche Diagnostics France S.A., 
Meylan, France. 

Another aspect of the invention concerns a microorganism which produces at 
least one spiramycin, which overexpresses: 

- a gene which can be obtained by polymerase chain reaction (PCR) using the pair of 

15 primers of the following sequence: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 
5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139), and as 
matrix the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, more 
preferably it is the gene of coding sequence SEQ ID No. 141, 

20 - or a gene derived therefrom due to the degeneracy of the genetic code. 

An example of sequence of such a gene is given in SEQ ID No. 1 1 1 (DNA); however, 
this sequence is partial since it does not comprise the 3' portion of the corresponding 
coding sequence. The translation of this portion of coding sequence into protein is given 
in SEQ ID No. 112. Those skilled in the art will easily be able to complete it in 

25 particular using the teaching given in example 24. This example thus gives a method for 
cloning the orf28c gene and for producing an expression vector which allows the 
expression of orf28c. This example also shows that overexpression of the orf28c gene in 
the strain OSC2 leads to the improvement of spiramycin production in this strain. 
Preferably, the microorganism overexpressing 
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- a gene which can be obtained by polymerase chain reaction (PCR) using the pair of 
primers of the following sequence: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 
5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139), and as 
5 matrix the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, more 
preferably it is the gene of coding sequence SEQ ID No. 141 , 

- or a gene derived therefrom due to the degeneracy of the genetic code 

is a bacterium of the genus Streptomyces; even more preferably, it is a bacterium of the 
species Streptomyces ambofaciens. Preferably, the overexpression of said gene is 

10 obtained by transformation of said microorganism with an expression vector; most 
preferably, the strain of microorganism is the strain OSC2/pSPM75(l) or of the strain 
OSC2/pSPM75(2) deposited with the Collection Nationale de Cultures de 
Microorganismes (CNCM) [National Collection of Cultures and Microorganisms] 
Pasteur Institute, 25, rue du Docteur Roux 75724 Paris Cedex 15, France, on October 6, 

15 2003, under the registration number 1-3101 . 

Another aspect of the invention concerns a method of producing spiramycin(s), 
using the microorganisms described in the preceding paragraph. This method consists in 
culturing, in a suitable culture inedium, a microorganism defined in the preceding 
paragraph, recovering the conditioned culture medium or a cell extract, and separating 

20 and purifying the spiramycin(s) produced from said culture medium or else from the 
cell extract obtained in the preceding step. The conditions for culturing such 
microorganisms may be determined according to techniques well known to those skilled 
in the art. The culture medium may, for example, be the MP5 medium or the SL1 1 
medium for Streptomyces and in particular for Streptomyces ambofaciens (Pernodet et 

25 a/., 1993). Those skilled in the art may in particular refer to the work by Kieser et aL 9 
2000 regarding the culturing of Streptomyces. The spiramycin(s) produced can be 
recovered using any techniques known to those skilled in the art. Those skilled in the art 
may, for example, refer to the techniques taught in US patent 3,000,785, and more 
particularly to the methods for extracting spiramycins described in that patent. 
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Preferentially, the microorganisms used in such a method are bacteria of the genus 
Streptomyces. More preferentially, the microorganisms are strains of S. ambofaciens. 

Another aspect of the invention concerns an expression vector wherein the 
polynucleotide of sequence SEQ ID No. 47, or a polynucleotide derived therefrom 
5 due to the degeneracy of the genetic code, is placed under the control of a promoter 
which allows expression of the protein encoded by said polynucleotide in 
Streptomyces ambofaciens. Examples of expression vectors which can be used in 
Streptomyces have been given above. Preferentially, such an expression vector is the 
plasmid pSPM524 or pSPM525. 

10 Another aspect of the invention concerns a strain of Streptomyces 

ambofaciens transformed with a vector defined in the preceding paragraph. 

Another aspect of the invention concerns a polypeptide the sequence of which 
comprises the sequence SEQ ID No. 1 12. The invention also relates to a polypeptide 
the sequence of which corresponds to the sequence translated from the coding 
15 sequence: 

- of a gene which can be obtained by a polymerase chain reaction (PCR) using 
the pair of primers of the following sequence: 

5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) and 
5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) and as 
20 matrix the cosmid pSPM36 or the total DNA of Streptomyces ambofaciens, more 
preferably it is the gene of coding sequence SEQ ID No. 141 , 

- or of a gene derived therefrom due to the degeneracy of the genetic code. 
Preferentially, these polypeptides are expressed in the natural state by a bacterium of 
the genus Streptomyces, more preferentially these polypeptides are involved in 

25 spiramycin biosynthesis. 

Another aspect of the invention concerns an expression vector which allows 
expression of a polypeptide as defined in the preceding paragraph in Streptomyces 
ambofaciens. Examples of expression vectors which can be used in Streptomyces have 
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been given above. Preferentially, the expression vector in question is the plasmid 
pSPM75. 

LIST OF FIGURES 

Figure 1 : Chemical structure of spiramycins I, II and III. 
Figure 2: Cosmids used for sequencing the region. 

Figure 3: Organization of a group of genes involved in the biosynthetic pathway for 
spiramycins. 

Figure 4: Proposed biosynthetic pathway for mycarose. 
Figure 5: Proposed biosynthetic pathway for mycaminose. 
Figure 6: Proposed biosynthetic pathway for forosamine. 

Figure 7: Preferential order of the addition of sugars to the spiramycin molecule and 
intermediates. 

Figure 8: Proposed biosynthetic pathway for methoxymalonyl in S. ambofaciens. This 
pathway is proposed by analogy with the biosynthesis of methoxymalonyl in 
Streptomyces hygroscopicus var. ascomyceticus (K. Wu et aL, 2000). 
Figure 9: Steps leading to the inactivation of a gene: 

A) Cloning of the target gene into a vector which replicates E. coli but not in 
Streptomyces; 

B) Insertion of the resistance cassette into the target gene (by cloning or 
recombination between short identical sequences); 

C) Introduction of the plasmid into Streptomyces ambofaciens (by 
transformation or conjugation with E, coli) and selection of the clones having 
integrated the cassette then screening of clones having lost the vector portion so 
as to have a gene replacement; 

D) Region of the chromosome of the mutant strain in which the target gene is 
inactivated by gene replacement. 

Figure 10: Optional steps following the inactivation of a gene according to the method 
described in figure 9, which may be carried out if an excisable cassette has been used to 
interrupt the target gene; 
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E) Introduction into the mutant strain of the plasmid pOSV508 carrying the xis 
and int genes of pSAM2, the products of which will allow effective excision by 
site-specific recombination between the attL and attR sequences bordering the 
cassette; 

5 F) Production of clones which have lost the excisable cassette and which are 

sensitive to the antibiotic against which the cassette provided resistance; 
G) After growth and sporulation on solid medium without antibiotic, the 
plasmid pOSV508 is lost at high frequency. It is thus possible to obtain clones 
sensitive to thiostrepton in which the target gene contains an in-phase deletion. 
10 The sequence of the deleted target gene can be controlled by PCR amplification 

and sequencing of the PCR product. 

Figure 11: Amplification of the excisable cassette with the aim of using it for a 
homologous recombination experiment. The technique of homologous recombination 
via short homologous sequences has been described by Chaveroche et al., 2000. The 39 
15 or 40 deoxynucleotides located at the 5' end of these oligonucleotides comprise a 
sequence corresponding to a sequence of the gene to be inactivated, and the 20 
deoxynucleotides located in the most 3' position correspond to the sequence of one of 
the ends of the excisable cassette. . 

Figure 12: Production of a construct for the inactivation of a target gene using the 
20 technique described by Chaveroche et al. 9 2000. 

Figure 13: Map of the plasmid pWHM3. Strepto ori: Streptomyces origin of replication. 
Figure 14: Map of the plasmid pOSV508. 

Figure 15: Example of the structure of an excisable cassette. This consists of the Qhyg 
cassette (Blondelet-Rouault et aL, 1997) bordered by the attR and attL sites (Raynal et 
25 a/., 1998), between which a recombination event will allow the excision of the cassette, 
by virtue of the expression of the xis and int genes. 
Figure 16: Map of the plasmid pBXLl 111. 
Figure 17: Map of the plasmid pBXLl 1 12. 
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Figure 18: Microbiological test for spiramycin production, based on the sensitivity of a 
strain of Micrococcus luteus to spiramycin. The strain of Micrococcus luteus used is a 
strain naturally sensitive to spiramycin but resistant to congocidine. The various strains 
of Streptomyces to be tested were cultured in 500 ml Erlenmeyer flasks containing 70 ml 
of MP5 medium, inoculated at an initial concentration of 2.5 x 10 6 spores/ml and grown 
at 27°C with orbital shaking at 250 rpm. Samples of fermentation musts were taken after 
48, 72 and 96 hours of culturing, and centrifuged. A ten-fold dilution of these 
supernatants in sterile culture medium is used for the test. The Micrococcus luteus 
indicator strain resistant to congocidine but sensitive to spiramycin (Gourmelen et al, 
1998) was cultured in a square 12 x 12 cm dish. Disks of Whatman AA paper were 
soaked with 70 ul of the ten-fold dilution of each supernatant and placed on the surface 
of the dish. Disks soaked with a solution of spiramycin of various concentrations 
(2-4-8 ug/ml in MP5 culture medium) were diluted as a standard range. The dishes were 
incubated at 37°C for 24 to 48 h. If the disk contains spiramycin, this diffuses into the 
agar and inhibits the growth of the Micrococcus luteus indicator strain. This inhibition 
creates a "halo" around the disk, this halo reflecting the area where the Micrococcus 
luteus strain has not grown. The presence of this halo is therefore an indication of the 
presence of spiramycin and makes it possible to determine whether the strain of 
S. ambofaciens in question is or is not a spiramycin producer. Comparison with the 
inhibition diameters obtained for the standard range makes it possible to obtain an 
indication of the amount of spiramycin produced by this strain. 

Figure 19: HPLC chromatogram of the filtered culture medium supernatant of the strain 
OSC2. 

Figure 20: HPLC chromatogram of the filtered culture medium supernatant of the strain 
SPM501. 

Figure 21: HPLC chromatogram of the filtered culture medium supernatant of the strain 
SPM502. 

Figure 22: HPLC chromatogram of the filtered culture medium supernatant of the strain 
SPM507. 
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Figure 23: HPLC chromatogram of the filtered culture medium supernatant of the strain 
SPM508. 

Figure 24: HPLC chromatogram of the filtered culture medium supernatant of the strain 
SPM509. 

5 Figure 25: Alignment of the Orf3 protein (SEQ ID No. 29) with the TylB protein (SEQ 
ID No. 87) of S.fradiae produced using the FASTA program. 

Figure 26: Alignment of the MdmA protein of S. mycarofaciens (SEQ ID No. 88) with 

the SrmD protein (SEQ ID No. 16) produced using the FASTA program. 

Figure 27: Example of sequences of the residual sites after excision of the excisable 

10 cassette. Indicated in bold is the minimum att26 site as defined in Raynal et aL 9 1998. 
The sequence of phase 1 (attl) of 33 nucleotides is given in SEQ ID No. 104, the 
sequence of phase 2 (att2) of 34 nucleotides is given in SEQ ID No. 105, and the 
sequence of phase 3 (att3) of 35 nucleotides is given in SEQ ID No. 95. Figure 28: 
Diagrammatic representation of the orflO gene, location of the PCR primers used and 

15 construct obtained with each pair of primers. 

Figure 29: Construction of the cassette pac-oritT. 
Figure 30: Map of the cosmid pWED2. 

Figure 31: Diagrammatic representation of the group of genes involved in the 
biosynthetic pathway for spiramycins and of the location of the three probes used to 
20 isolate the cosmids of the genomic DNA library of the Streptomyces ambofaciens strain 
OSC2 in E. coli (cf. example 19). 

Figure 32: Location of the inserts of the cosmids isolated from the genomic DNA 
library of the Streptomyces ambofaciens strain OCS2 in E. coli (cf. example 19).New 
cosmid DNA library. Figure 33: Subcloning of the Pstl-Pstl fragment of the cosmid 
25 pSPM36 (insert of the plasmid pSPM58), subcloning of the Stul-Stul fragment of the 
cosmid pSPM36 (insert of the plasmid pSPM72) and subcloning of an EcoRl-Stul 
(insert of the plasmid pSPM73). 

Figure 34: Location of the open reading frames identified in the Pstl-Pstl insert of the 
plasmid pSPM58 and in the EcoRl-Stul insert of the plasmid pSPM73. 
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Figure 35: Superposition of the HPLC chromatograms of the filtered culture medium 

supernatant of the strain OS49.67 produced at 238 and 280 nm (top) and UV spectra of 

the molecules eluted at 33.4 minutes and 44.8 minutes (bottom). 

Figure 36: Molecular structure of the platenolide A and platenolide B molecules. 

Figure 37: Organization of the group of genes involved in the biosynthetic pathway for 

spiramycins. 

Figure 38: Molecular structure of a biosynthesis intermediate produced by the strain 
SPM507. 

Figure 39: Structure of a biosynthesis intermediate produced by a strain of 
5. ambofaciens of genotype orf6*::attlQhyg+ obtained from a strain which 
overproduces spiramycins. Insertion of the cassette attlQhyg+ into or/6* exerts a polar 
effect which prevents the expression of or/?*. 

Figure 40: Molecular structure of the platenolide A + mycarose and platenolide B + 
mycarose molecules produced by the strain OS49.67 

Figure 41: Subcloning of a Pstl-PstI fragment of the cosmid pSPM36 (insert of the 
plasmid (pSPM79)), localization of the open reading frames identified in the Pstl-PstI 
insert of the plasmid pSPM79 and localization of the sequence SEQ ID N° 140. 
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The present invention is illustrated using the following examples, which should 
be considered as nonlimiting illustrations. 

In summary, the genes of the biosynthetic pathway for spiramycins were 
isolated from a genomic DNA library of Streptomyces ambofaciens. This library was 
obtained by partial digestion of the genomic DNA of Streptomyces ambofaciens, with 
the BamUl restriction enzyme. Large DNA fragments, of 35 to 45 kb on average, were 
cloned into the cosmid pWEDl (Gourmelen et al, 1998) derived from the cosmid 
pWED15 (Wahl et aL, 1987). These cosmids were introduced into E. coli using phage 
particles. The library thus obtained was hybridized with a probe (of sequence SEQ ID 
No. 86) corresponding to a portion of the tylB gene of S. fradiae (Merson-Davies & 
Cundliffe, 1994, GenBank accession number: U08223). After hybridization, one cosmid 
out of the 4 which hybridized with the probe was more particularly selected. This 
cosmid, named pOS49.1, was then digested with Sac\ and a 3.3 kb fragment containing 
the region which hybridized with the probe was subcloned into the vector pUC19 and 
sequenced. Four open reading frames were identified and one of them encodes a protein 
(SEQ ID No. 29) exhibiting strong sequence similarity with the TylB protein of 
S. fradiae (SEQ ID No. 87) (cf. figure 25). This gene was named orf3 (SEQ ID No. 28) 
and was inactivated in S. ambofaciens. It was possible to demonstrate that the clones in 
which the or/3 gene was inactivated no longer produced spiramycins. This shows the 
involvement of the or/3 gene, or of genes located downstream, in spiramycin 
biosynthesis. 

Once this confirmation had been obtained, a larger region of the cosmid pOS49.1 
was sequenced, either side of the Sac\ fragment previously studied. Thus, it was 
possible to obtain, from the cosmid pOS49.1, the sequence of a region comprising 
seven whole open reading frames and two other incomplete open reading frames 
located on either side of these seven whole open reading frames. By virtue of a search 
in the databases, it was possible to show that one of the incomplete open reading frames 
corresponded to the srmG locus (a region encoding an enzyme called "polyketide 
synthase" (PKS)). The corresponding genes were subcloned by S. Burgett et al in 1996 
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(US patent 5,945,320). Moreover, the other open reading frames, the seven whole ORFs 
named: orfl, orf2, or/3, orff, orf5, or/6 and or/7 (SEQ ID No. 23, 25, 28, 30, 34, 36, 
40) and the start of the eighth ORF named or/8 (sequence SEQ ID No. 43) were not 
found in the databases. 

Subsequently, and with the aim of cloning other genes involved in spriramycin 
biosynthesis, other cosmids comprising fragments of the S. ambofaciens genome in this 
same region were isolated. For this, a further series of hybridizations on colonies was 
carried out using three probes. The first probe corresponds to a 3.7 kb DNA fragment 
containing orfl, or/2 and the start of or/3, subcloned from pOS49.1. The second probe 
corresponds to a 2 kb DNA fragment containing a portion of or/7 and a portion of or/8, 
subcloned from pOS49.1. A third probe was also used. This latter probe corresponds to 
a 1.8 kb DNA fragment containing the srmD gene. The srmD gene is a gene isolated 
from S. ambofaciens capable of imparting spiramycin resistance. Specifically, prior 
studies had enabled the cloning of several resistance determinants of S. ambofaciens, 
imparting spiramycin resistance to a strain of S. griseofuscus (spiramycin-sensitive 
strain) (Pemodet et al., 1993) (Pernodet et al, 1999). To isolate resistance genes, a 
cosmid library of the genomic DNA of the strain S. ambofaciens had been produced in 
the cosmid pKC505 (Richardson MA et al, 1987). This pool of cosmids had been 
introduced into S. griseofuscus, naturally sensitive to spiramycin. Five cosmids capable 
of imparting apramycin resistance and spiramycin resistance in S. griseofuscus had thus 
been obtained. Among these 5 cosmids, a cosmid named pOS44.1 contains, in its insert, 
the srmD gene which encodes a protein exhibiting a certain similarity with the protein 
encoded by the mdmA gene of Streptomyces mycarofaciens and involved in 
midecamycin resistance in this producer organism (Hara et al, 1990, GenBank 
accession number: A60725) (figure 26). The third probe used to locate the spiramycin 
biosynthesis genes was an insert of approximately 1 .8 kb comprising the srmD gene. 

These three probes were used to hybridize the genomic DNA library described 
above and made it possible to choose two cosmids (pSPM7 and pSPM5) liable to 
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contain the longest inserts and having no common bands. The cosmid pSMP5 
hybridized with the first and the second probe, but did not hybridize with the third 
probe, whereas the cosmid pSPM7 hybridized with the third probe only. These two 
cosmids were entirely sequenced using the "shotgut sequencing" technique. The 
sequences of the inserts of these two cosmids: pSPM7 and pSPM5, could be assembled 
since, although they did not overlap, each one of the inserts comprised a known 
sequence at one of its ends. Specifically, each one of these inserts comprised a fragment 
of the sequence of one of the genes encoding an enzyme called "polyketide synthase" 
(PKS). These 5 genes were cloned by S. Burgett et al in 1996 (US patent 5,945,320) 
(cf. figure 2). Thus, it was possible to determine a single S. ambofaciens genomic DNA 
sequence. A sequence of 30 943 nucleotides starting, in the 5' position, from an EcoRl 
site located in the first PKS gene and ranging up to a BamHl site in the 3' position is 
given in SEQ ID No. 1. This sequence corresponds to the region upstream of the PKS 
genes (cf. figure 2 and 3). A second region of 11 171 nucleotides, starting from a Pstl 
site in the 5' position and ranging up to a BstEll site in the 3' position, located in the 
fifth PKS gene, is given in SEQ ID No. 2. This region is the region downstream of the 
PKS genes (downstream and upstream being defined by the orientation of the 5 PKS 
genes or oriented in the same direction) (cf. figures 2 and 3). 

Next, and with the aim of cloning other genes involved in spiramycin 
biosynthesis, other cosmids comprising fragments of the S. ambofaciens genome in this 
same region were isolated (cf. example 18 and 19). 
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EXAMPLE 1: Construction of a genomic DNA library of the Streptomyces 
ambofaciens strain ATCC23877 in E. coli 

1.1. Extraction of the genomic DNA of the Streptomyces ambofaciens strain 
ATCC23877. 

5 The Streptomyces ambofaciens strain ATCC23877 (accessible in particular 

from the American Type Culture Collection (ATCC) (Manassas, Virginia, USA), under 
the number 23877) was cultured in YEME (Yeast Extract-Malt Extract) (Kieser, T, et 
aL, 2000) and the genomic DNA of this strain was extracted and purified according to 
standard techniques of lysis and precipitation (Kieser T. et aL, 2000). 

10 1 .2. Construction of the genomic DNA library 

The genomic DNA of the Streptomyces ambofaciens strain ATCC23877 as 
isolated above was partially digested with the BamHl restriction enzyme so as to obtain 
DNA fragments having a size between approximately 35 and 45 kb. These fragments 
were cloned into the cosmid pWEDl (Gourmelen et aL, 1998) digested beforehand 

15 with BamHl. The cosmid pWEDl is derived from the cosmid pWE15 (Wahl, et aL, 
1987) by deletion of the 4.1 kb Hpal-Hpal fragment containing the expression module 
which is active in mammals (Gourmelen et aL, 1998). The ligation mixture was then 
encapsidated in vitro in lambda phage particles using the "Packagene® Lambda DNA 
packaging system" marketed by the company Promega, according to the manufacturer's 

20 recommendations. The phage particles obtained were used to infect the SURE® strain 
of E. coli marketed by the company Stratagene (LaJolla, California, USA). The clones 
were selected on LB medium + ampicillin (50 ng/ml) since the cosmid pWEDl imparts 
ampicillin resistance. 
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EXAMPLE 2: Isolation and characterization of genes involved in spiramycin 
biosynthesis in Streptomyces ambofaciens 

2.1 Colony hybridization of the K coli clones of the genomic library of Strevtomvces 
ambofaciens ATCC23877 

Approximately 2 000 E. coli clones of the library obtained above were 
transferred onto a filter for colony hybridization. The probe used for the hybridization is 
an Nael-Nael DNA fragment (SEQ ID No. 86) comprising a portion of the tylB gene of 
Streptomyces fradiae. This fragment corresponds to nucleotides 2663 to 3702 of the 
DNA fragment described by L.A. Merson-Davies & E. Cundlifife (L.A. Merson-Davies 
& E. Cundliffe, 1994, GenBank accession number: U08223), in which the coding 
sequence of the tylB gene corresponds to nucleotides 2677 to 3843. 

The Nael-Nael DNA fragment carrying a portion of the tylB gene of Streptomyces 
fradiae (SEQ ID No. 86) was labeled with 32 P using the random priming technique (Kit 
marketed by the company Roche) and used as a probe to hybridize the 2 000 clones of 
the library, transferred onto a filter. The membranes used are Hybond N nylon 
membranes marketed by the company Amersham (Amersham Biosciences, Orsay, 
France) and the hybridization was carried out at 55°C in the buffer described by Church 
& Gilbert (Church & Gilbert, 1984). A wash was carried out in 2X SSC at 55°C for a 
period of 15 minutes and two successive washes in 0.5X SSC were then carried out at 
55°C, each for a period of 15 minutes. Under these hybridization and washing 
conditions, 4 clones out of the 2 000 hybridized exhibited a strong hybridization signal. 
These 4 clones were cultured in LB medium + ampicillin (50 ng/ml) and the 
corresponding 4 cosmids were extracted by standard alkaline lysis (Sambrook et al. 9 
1989). It was then verified that the hybridization was indeed due to a DNA fragment 
present in the insert of these four cosmids. For this, the cosmids were digested 
independently with several enzymes (BamHl, Pstl and Sacl). The digestion products 
were separated on agarose gel, transferred onto a nylon membrane and hybridized with 
the Nael-Nael DNA fragment comprising a portion of the tylB gene of Streptomyces 
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fradiae (cf. above) under the same conditions as above. It was possible to validate the 
four cosmids and one of these cosmids was more particularly selected and named 
pOS49.1. 

2.2 Verification of the involvement of the identified region and sequencing of the insert 
5 of the cosmid pOS49.1 

Several fragments of the insert of the cosmid pOS49.1 were subcloned and their 
sequences were determined. The cosmid pOS49.1 was digested with the Sad enzyme 
and it was shown, by Southern blotting, under the conditions described above, that a 

3.3 kb fragment contains the region which hybridizes with the tylB probe. This 3.3 kb 
fragment was isolated by electroelution from a 0.8% agarose gel, and then cloned into 
the vector pUC19 (GenBank accession number: M77789) and sequenced. The plasmid 
thus obtained was named pOS49.11. Four open reading frames exhibiting a codon use 
typical of Streptomyces could be identified in this fragment (two complete and two 
truncated open reading frames), using the FramePlot program (J. Ishikawa & K. Hotta, 
1999). Sequence comparisons using the FASTA program (cf. (W.R. Pearson & 
D.JLipman, 1988) and (W.R. Pearson, 1990), accessible in particular from the 
INFOBIOGEN resource center, Evry, France) made it possible to show that the protein 
deduced from one of these four open reading frames exhibits strong sequence 
similarities with the TylB protein (SEQ ID No. 87; GenBank accession number: 
U08223) of S. fradiae (cf. figure 25). This protein was named Orf3 (SEQ ID No. 29). 

With the aim of testing whether the corresponding gene (the or/3 gene (SEQ ID 
No. 28)) was involved in spiramycin biosynthesis in S. ambofaciens, this gene was 
interrupted with an £2hyg cassette (M-H. Blondelet-Rouault et aL, 1997, GenBank 
accession number: X99315). For this, the plasmid pOS49.1 1 was digested with the Xhol 
25 enzyme and the fragment containing the four open reading frames (two complete and 
two truncated, including or/3 in its entirety) was subcloned into the Xhol site of the 
vector pBC SK+ marketed by the company Stratagene (LaJolla, California, USA). The 
plasmid thus obtained was named pOS49.12. For the purpose of inactivating or/3, a 
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Pmll-BstEn fragment inside or/3 was replaced with the Qhyg cassette by blunt-ended 
cloning into the latter plasmid. For this, the plasmid pOS49.12 was digested with the 
Pmtl and BstEll enzymes, the unique site for which is in the coding sequence of the 
or/3 gene. The ends of the fragment corresponding to the vector were blunt-ended by 
treatment with the Klenow enzyme (DNA polymerase I large fragment). The Qhyg 
cassette was obtained by digestion, with the BamWl enzyme, of the plasmid pHP45 
Qhyg (Blondelet-Rouault et al, 1997, GenBank accession number: X99315). The 
fragment corresponding to the Qhyg cassette was recovered on agarose gel and its ends 
were blunt-ended by treatment with the Klenow enzyme. The two blunt-ended 
fragments thus obtained (the Qhyg cassette and the plasmid pOS49.12) were ligated and 
the ligation product was used to transform E. coli bacteria. The plasmid thus obtained 
was named pOS49.14 and contains the or/3 gene interrupted with the Qhyg cassette. 

The insert of the plasmid pOS49.14, in the form of an Xhol-Xhol fragment the 
ends of which were made non-sticky by treatment with the Klenow enzyme, was cloned 
into the EcoRV site of the plasmid pOJ260 (the plasmid pOJ260 is a conjugative 
plasmid capable of replicating in E. coli but incapable of replicating in S. ambofaciens 
(M. Bierman et al., 1992). This plasmid imparts apramycin resistance in E. coli and 
Streptomyces). The plasmid obtained (insert of the plasmid pOS49.14 cloned into the 
plasmid pOJ260) was named pOS49.16. The latter was transferred into the 
S. ambofaciens strain ATCC23877 by conjugation using the conjugated E. coli strain 
SI 7-1, as described by Mazodier et al. (Mazodier et al., 1989). Thef. coli strain SI 7-1 
is derived from the E. coli strain 294 (Simon et al., 1983) (Simon, et al, 1986). It was 
possible to obtain transconjugant clones possessing the hygromycin resistance marker 
carried by the Qhyg cassette and having lost the apramycin resistance marker carried by 
the vector pOJ260. For this, after conjugation, the clones were selected for their 
hygromycin resistance. The hygromycin-resistant clones were then subcultured 
respectively on medium with hygromycin (antibiotic B) and on medium with apramycin 
(antibiotic A) (cf. figure 9). The clones resistant to hygromycin (HygR) and sensitive to 
apramycin (ApraS) are, in principle, those in which a double recombination event has 
occurred and which therefore possess the or/3 gene interrupted with the Qhyg cassette. 
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Replacement of the wild-type copy of or/3 with the interrupted copy was verified by 
two successive hybridizations. Thus, the total DNA of the clones obtained was digested 
with various enzymes, separated on agarose gel, transferred onto a membrane and 
hybridized with a probe corresponding to the Qhyg cassette (cf. above) in order to 
verify the presence of the cassette in the genomic DNA of the clones obtained. A 
second hybridization was carried out using, as probe, the XhoVXhol of the plasmid 
pOS49.11 containing the four open reading frames (two complete and two truncated, 
including or/3 in its entirety). The verification of the genotype can also be carried out 
by any method known to those skilled in the art, and in particular by PCR using the 
appropriate oligonucleotides and sequencing of the PCR product. One of the 
or/3:: Qhyg clones thus obtained, and the genotype of which was verified, was chosen 
and was called OS49.16. 

The spiramycin production of the OS49.16 clone thus obtained was tested using 
the production test described below (cf. example 15). It was thus possible to 
demonstrate that this strain no longer produces spiramycin, confirming the involvement 
of or/3 and/or of the genes located downstream, such as, for example, or/4, in 
spiramycin biosynthesis. 

Once this confirmation had been obtained, a larger region of the cosmid pOS49.1 
was sequenced, on either side of the Sacl fragment previously studied. Thus, it was 
possible to obtain, from the cosmid pOS49.1, the sequence of a region comprising 
seven whole open reading frames and two other incomplete open reading frames, 
located on either side of these seven open reading frames. By virtue of a search in the 
databases, it was possible to show that one of the incomplete open reading frames 
corresponded to the srmG locus (a region encoding an enzyme called "polyketide 
synthase" (PKS)). The corresponding genes were cloned by S. Burgett et al in 1996 
(US patent 5,945,320). Moreover, the other open reading frames: the seven whole ORFs 
named: orfl, or/2, orJ3 t orf4, orf5, orf6 and or/7 (SEQ ID Nos 23, 25, 28, 30, 34, 36 
and 40) and the start of the eighth ORF named or/8 (the whole sequence of this orf is 
given in SEQ ID No. 43), were not found in the databases. 
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EXAMPLE 3: Isolation and characterization of other genes involved in 
spiramycin biosynthesis in Streptomyces ambofaciens 

Secondly, and with the aim of cloning other genes involved in spiramycin 
biosynthesis, other cosmids comprising fragments of the S. ambofaciens genome in this 
same region were isolated. For this, a further series of colony hybridizations was carried 
out using three probes: 

- The first probe used corresponds to a 3.7 kb BamHl-Pstl DNA fragment containing a 
fragment of the PKS gene (the genes corresponding to PKS were cloned by S. Burgett 
et ah in 1996 (US patent 5,945,320)), orfl, or/2 and the start of or/3, subcloned from 
pOS49.1 and ranging from a BamHl site located 1 300 base pairs upstream of the EcoRl 
site defining position 1 of SEQ ID No. 1 up to the Pstl site located at 2472 (SEQ ID 
No. 1). This BamHl-Pstl fragment was subcloned, from pOS49.1, into the plasmid pBC 
SK+, which made it possible to obtain the plasmid pOS49.28. 

- The second probe used corresponds to a Pstl-BamHl DNA fragment of approximately 
2 kb containing a fragment of orf7 and of or/8, subcloned from pOS49.1 and ranging 
from a Pstl site located at position 6693 of SEQ ID No. 1 up to the BamHl site located 
at position 8714 of SEQ ID No. 1. This Pstl-BamHl fragment was subcloned, from 
pOS49.1 into the plasmid pBC SK+, which made it possible to obtain the plasmid 
pOS49.76. 

- A third probe was also used. This corresponds to a 1.8 kb EcoRl-Hindlll DNA 
fragment containing the srmD gene. The srmD gene is a gene isolated from 
S, ambofaciens capable of imparting spiramycin resistance. Specifically, prior studies 
had enabled the cloning of several resistance determinants of 5. ambofaciens, imparting 
spiramycin resistance to a strain of S. griseofuscus (spiramycin-sensitive strain) 
(Pernodet et al. 9 1993) (Pernodet et aL % 1999). To isolate resistant genes, a cosmid 
library of the genomic DNA of the S. ambofaciens strain ATCC23877 had been 
prepared in the cosmid pKC505 (M.A. Richardson et al., 1987). For this, the genomic 
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DNA of the S. ambofaciens strain ATCC23877 had been partially digested with Sau3A\ 
so as to obtain fragments having a size of between approximately 30 and 40 kb. The 
genomic DNA thus digested (3 \ig) had been ligated with 1 |ig of pKC505 digested 
beforehand with the BarnUl enzyme (Pernodet et aL, 1999). The ligation mixture had 
5 then been encapsidated in vitro in phage particles. The phage particles obtained had 
been used to infect the E. coli strain HB101 (accessible in particular from the American 
Type Culture Collection (ATCC) (Manassas, Virginia, USA), under the number 33694). 
Approximately 20 000 apramycin-resistant E. coli clones had been pooled and the 
cosmids of these clones had been extracted. This pool of cosmids had been introduced 

10 by protoplast transformation into the S. griseofuscus strain DSM 10191 (K.L. Cox & 
R.H. Baltz, 1984), naturally sensitive to spiramycin (R.N. Rao et aL 9 1987, this strain is 
available in particular from the German Collection of Microorganisms and Cell 
Cultures (Deutsche Sammlung von Mikro-organismen und Zellkulturen GmbH, 
DSMZ), (Braunschweig, Germany), under the number DSM 10191). The transformants 

15 had been selected on a medium containing apramycin. 1 300 of the clones growing on 
medium containing apramycin had been transferred onto medium containing 5 fig/ml of 
spiramycin. Several apramycin-resistant clones had also grown on medium containing 
spiramycin and the cosmids of these colonies had been extracted and used to transform 
E. coli and S. griseofuscus (Pernodet et al. 9 1999). Five cosmids capable of imparting 

20 apramycin resistance to E. coli and co-resistance to apramycin and spiramycin in 
S. griseofuscus have thus been obtained. Among these 5 cosmids, it was determined 
that a cosmid named pOS44.1 contains, in its insert, a gene (SEQ ID No. 15) which 
encodes a protein (SEQ ID No. 16) exhibiting a certain similarity with a protein 
encoded by the mdmA gene of Streptomyces mycarofaciens (SEQ ID No. 88); this gene 

25 was named srmD (cf. alignment shown in figure 26, carried out using the FASTA 
program (cf.. (W.R. Pearson & DJ. Lipman, 1988) and (W.R. Pearson, 1990), 
accessible in particular from the INFOBIOGEN resource center, Evry, France)). 

To isolate the resistance determinant contained in the plasmid pOS44.1, the latter 
was partially digested with the Sau3A\ restriction enzyme so as to obtain fragments 
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having a size of approximately 1.5 to 3 kb, and these fragments were ligated into the 
vector pIJ486 linearized with the BamUl enzyme (Ward et aL, 1986). A plasmid was 
selected for its ability to impart spiramycin resistance in the S. griseqfuscus strain DSM 
10191 (R.N. Rao et aL, 1987), naturally sensitive to spiramycin (cf. above). For this, the 
pool of plasmids corresponding to the Sau3Al fragment of pOS44.1 ligated into the 
vector pIJ486 (cf. above) was introduced by protoplast transformation into the strain 
DSM 10191 and the transformants were selected for their thiostrepton resistance (due to 
the tsr gene carried by pIJ486). The clones growing on medium containing thiostrepton 
were transferred onto medium containing spiramycin. Several thiostrepton-resistant 
clones also grew on medium containing spiramycin and the plasmids of these colonies 
were extracted. A plasmid imparting resistance and containing an approximately 1.8 kb 
insert was selected and named pOS44.2. This 1.8 kb insert can be excised easily by 
virtue of a Hindlll site and an EcoRl site present in the vector on either side of the 
insert. This 1.8 kb Hindlll-EcoRl insert was sequenced and the resistance gene that it 
contains was named srmD. This fragment containing the srmD gene could thus be 
easily subcloned into the vector pUC19 (GenBank accession number: M77789) opened 
with EcoRl-Hindlll, and the plasmid obtained was named pOS44.4. The 1.8 kb Hindlll- 
EcoRl insert, containing the srmD gene, of this plasmid was used as a probe to locate 
the spiramycin biosynthesis genes (cf. below). 

A sample of an Escherichia Coli DH5a strain containing the plasmid pOS44.4 
was deposited with the Collection Nationale de Cultures de Microorganismes [National 
Collection of Cultures and Microorganisms] (CNCM) Pasteur Institute, 25, rue du 
Docteur Roux 75724 Paris Cedex 15, France, on July 10, 2002, under the registration 
number 1-2918. 

Approximately 2 000 clones of the library, obtained above (cf. example 1), were 
transferred onto a filter for colony hybridization according to conventional techniques 
(Sambrook etal, 1989). 
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The three probes described above were labeled with 32 P using the random priming 
technique (Kit marketed by Roche) and used to hybridize the 2 000 clones of the 
library, transferred onto the filter. The hybridization was carried out at 65°C in the 
buffer described by Church & Gilbert (Church & Gilbert, 1984). A wash was carried 
out in 2X SSC at 65°C for 15 minutes and two successive washes were then carried out 
in 0.5X SSC at 65°C, each for a period of 15 minutes. Under these hybridization and 
washing conditions, 16 clones out of the 2 000 hybridized exhibited a strong 
hybridization signal with at least one of the probes. However, no cosmid hybridized 
with the three probes. The 16 cosmids were extracted and digested with the BamHl 
restriction enzyme. Comparison of the restriction profiles of these various cosmids with 
one another led to two cosmids being chosen which were liable to contain the longest 
inserts of the region and which had no common bands. Thus, two cosmids, one named 
pSPM5 and the other pSPM7, were chosen. The cosmid pSPM5 hybridized with the 
probes orfl to or/4 and the probe or/8, but did not hybridize with the probe srmD. 
pSPM7 hybridized only with the probe srmD and not with the other two probes. 

These two cosmids were entirely sequenced using the "shotgun sequencing" 
technique. The sequences of the inserts of these two cosmids: pSPM7 and pSPM5, 
could be assembled since, although they did not overlap, each one of the inserts 
comprised a known sequence at one of its ends. Specifically, each one of these inserts 
comprised a fragment of the sequence of one of the genes encoding an enzyme called 
"polyketide synthase" (PKS). These 5 genes were cloned by S. Burgett et al in 1996 
(US patent 5,945,320) (cf. figure 2). Thus, it was possible to determine a single 
S. ambofaciens genomic DNA sequence. A sequence of 30 943 nucleotides beginning, 
in the 5' position, from an EcoRl site located in the first PKS gene and ranging up to a 
BamHI site in the 3' position is given in SEQ ID N° 1. This sequence corresponds to 
the region upstream of the PKS genes (cf. figure 2 and 3). A second region of 1 1 171 
nucleotides, starting from a Pstl site in the 5' position and ranging up to an Ncol site in 
the 3* position located in the fifth PKS gene, is given in SEQ ID No. 2. This second 
region is the region downstream of the PKS genes (downstream and upstream being 
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defined by the orientation of the 5 PKS genes all oriented in the same direction) (cf. 
figure 2 and 3). 



EXAMPLE 4: Analysis of the nucleotide sequences, determination of the open 
5 reading frames and characterization of the genes involved in spiramycin 
biosynthesis 

The sequences obtained were analyzed using the FramePlot program (J. Ishikawa 
& K. Hotta 1999). This made it possible to identify, among the open reading frames, the 
open reading frames exhibiting a codon use typical of Streptomyces. This analysis made 

10 it possible to determine that this region comprises 35 ORFs located on either side of 
five genes encoding the enzyme "polyketide synthase" (PKS). 10 and 25 ORFs were 
identified respectively downstream and upstream of these genes (downstream and 
upstream being defined by the orientation of the 5 PKS genes all oriented in the same 
direction) (cf. figure 3). Thus, the 25 open reading frames of this type, occupying a 

15 region of approximately 3 1 kb (SEQ ID No. 1 and figure 3), were identified upstream of 
the 5 PKS genes and 10 occupying a region of approximately 11.1 kb (SEQ ID No. 2 
and figure 3) were identified downstream of the PKS genes. The genes of the upstream 
region were named orfl, orf2, orJ3, orf4, orf5, orf6, or/7, orf8 t orf9c, or/10, orfllc, 
orfl 2, orfl 3c, orfl 4, orfl 5c, orfl 6, orfl 7, orfl 8, orfl 9, orf20, orf21c f orf22c, orf23c, 

20 orf24c and orf25c (SEQ ID Nos 23, 25, 28, 30, 34, 36, 40, 43, 45, 47, 49, 53, 60, 62, 
64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84). The genes of the downstream region were 
named orfl*c, orf2*c, or/3*c, orff*c, orf5* orf6* orJ7*c, or/8* or/9* and or/70* 
(SEQ ID Nos 3, 5, 7, 9, 1 1, 13, 15, 17, 19 and 21). The "c" added to the name of the 
gene means, for the ORF in question, that the coding sequence is in the reverse 

25 orientation (the coding strand is therefore the strand complementary to the sequence 
given in SEQ ID No. 1 or SEQ ID No. 2 for these genes) (cf. figure 3). 

The protein sequences deduced from these open reading frames were compared 
with those present in various databases using various programs: BLAST (Altschul et 
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al„ 1990) (Altschul et ai, 1997), CD-search, COGs (Cluster of Orthologous Groups) 
(these three programs are accessible in particular from the National Center for 
Biotechnology Information (NCBI) (Bethesda, Maryland, USA)), FASTA ((W.R. 
Pearson & D. J. Lipman, 1988) and (W.R. Pearson, 1990), BEAUTY (K.C. Worley et 
5 a/., 1995)), (these two programs are accessible in particular from the INFOBIOGEN 
resource center, Evry, France). These comparisons made it possible to formulate 
hypotheses regarding the function of the products of these genes and to identify those 
which may be involved in spiramycin biosynthesis. 



10 EXAMPLE 5: Gene inactivation: principle of the construction of a knocked-out 
strain of Streptomyces ambofaciens 

The methods used consist in carrying out a gene replacement. The target gene to be 
interrupted is replaced with a copy of this gene interrupted with a cassette imparting 
resistance to an antibiotic (for example apramycin or hygromycin), as illustrated in 
15 figure 9. The cassettes used are bordered on either side by translation termination 
codons in all the reading frames and by transcription terminators which are active in 
Streptomyces. 

The insertion of the cassette into the target gene may or may not be accompanied by 
a deletion in this target gene. The size of the regions flanking the cassette can range 
20 from a few hundred to several thousand base pairs. 

The constructs required for the inactivation of the gene with the cassette were 
obtained in E. coli, the reference organism for obtaining recombinant DNA constructs. 
The interrupted gene was obtained in a plasmid which replicates in E. coli but which 
cannot replicate in Streptomyces. 

25 The constructs were then subcloned into vectors so as to allow the transformation 
and the inactivation of the desired gene in S. ambofaciens. For this, two plasmids were 
used: 
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- pOJ260 (M. Bierman et al, 1992) (cf. example 2) . which imparts apramycin 
resistance in E. coli and Streptomyces and which was used when the target gene 
was interrupted with a cassette imparting hygromycin resistance. 

- pOSK1205 (4726 bp). This plasmid derives from the plasmid pBK-CMV 
(marketed by the company Stratagene (LaJolla, California, USA)) in which an 
Avrll fragment containing the sequence encoding Neomycin/Kanamycin 
resistance has been replaced with a sequence encoding hygromycin resistance, 
while at the same time conserving the P SV40 promoter. For this, the plasmid 
yUVAS-Qhyg (Blondelet-Rouault et aL, 1997) was digested with the Notl and 
Pflml enzymes and the fragment conferring hygromycin resistance was 
subcloned into the ^vrll site of the vector pBK-CMV after all the ends had been 
blunt-ended by treatment with the Klenow enzyme. In pOSK1205, the cassette 
which confers hygromycin resistance is preceded by the pSV40 promoter. This 
plasmid confers hygromycin resistance in E. coli and Streptomyces and was used 
when the target gene was interrupted with a cassette conferring apramycin 
resistance. 

The cassettes were introduced into the target gene either by cloning using restriction 
sites present in the target gene, of by recombination between short identical sequences 
as described, for example, by Chaveroche et al (M.K. Chaveroche et al. 9 2000). 

The plasmid carrying the gene interrupted by the cassette can then be introduced into 
Streptomyces ambofaciens, for example by conjugation between E. coli and 
Streptomyces (P. Mazodier et al 9 1989). This technique was used when the basic vector 
is the vector pOJ260. A second technique may be used: the technique of protoplast 
transformation after denaturation of the DNA by alkali treatment (T. Kieser et al. 9 
2000), in order to increase the frequency of recombination as described, for example, by 
Oh & Chater (Oh & Chater, 1997). This technique was used when the basic vector is 
pOJ260 or pOSK1205 (cf. below). The transformants are then selected with the 
antibiotic corresponding to the cassette present in the target gene (cf. figure 9, antibiotic 
B). A mixture of clones, among which there has been integration by a single or by two 
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recombination events, is thus selected. Next, the clones sensitive to the antibiotic for 
which the resistance gene is present in the vector (outside the recombination cassette) 
(cf. figure 9, antibiotic A) are sought. It is thus possible to select the clones for which 
there have, in principle, been two recombination events resulting in the replacement of 
5 the wild-type gene with the copy interrupted by the cassette. These steps are shown 
diagrammatically in figure 9. 

Several cassettes can be used to interrupt the target genes. The Qhyg cassette which 
confers hygromycin resistance (Blondelet-Rouault et al. 9 1997, GenBank accession 
number: X99315) can, for example, be used. 

10 

EXAMPLE 6: Construction of a strain of Streptomyces ambofaciens with an in- 
phase knockout in the or/3 gene 

The or/ 3 gene had been interrupted with the Qhyg cassette (cf. example 2.2) and 
it was possible to demonstrate that an or/3:: £2hyg strain no longer produces spiramycin, 

15 confirming the involvement of one or more genes of the cloned region in spiramycin 
biosynthesis (cf. example 2.2). In view of their orientation, cotranscription of ORFs 1 to 
7 is probable (cf. figure 3) and the phenotype observed (non-producer of spiramycins) 
may be due to the inactivation of one or more of the genes cotranscribed with or/3. To 
confirm the involvement of or/3 in spiramycin biosynthesis, a further inactivation of the 

20 or/3 gene was carried out, the latter inactivation being carried out in phase. For this, a 
Dralll fragment inside or/3 of 504 base pairs was deleted. A DNA fragment obtained 
from pOS49.1, ranging from the EcoRl site located at position 1 (SEQ ID No. 1) up to 
the Sad site located at position 5274 (SEQ ID No. 1), and which comprises a deletion 
between the two Z>aIII sites at positions 2563 and 3067 (504 nucleotides removed), 

25 was cloned into the plasmid pOJ260 (M. Bierman et aL, 1992). The plasmid thus 
obtained was named pOS49.67. 

The insert of pOS49.67 therefore consists of a DNA fragment of 5. ambo/aciens 
containing the or/1 gene, or/2 gene, or/3 gene with the in-phase deletion, or/4 gene and 
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a portion of or/5. The vector in which this insert was subcloned is pOJ260, the plasmid 
pOS49.67 therefore imparts apramycin resistance and was introduced into the strain 
OS49.16 by protoplast transformation (cf. example 2). Since the strain OS49.16 is 
resistant to hygromycin, hygR and apraR transformants were obtained. After passaging 
such clones twice on nonselective medium, the clones sensitive to apramycin and to 
hygromycin (apraS and hygS) were sought. In some of these clones, a recombination 
event between homologous sequences is in fact expected to lead to the replacement of 
the copy of or/3 interrupted with the Qhyg cassette (contained in the genome of the 
strain OS49.16) with the copy of or/3 with the in-phase deletion present on the vector. 
The clones resulting from this recombination are expected to be apraS and hygS after 
elimination of the vector sequences. The genotype of the strains thus obtained can be 
verified by hybridization or by PCR and sequencing of the PCR product (to verify that 
only one in-phase deleted copy of or/3 is present in the genome of the clones obtained). 
Clones having only the copy of or/3 with an in-phase deletion were thus obtained and 
their genotype was verified. A clone exhibiting the desired characteristics was more 
particularly selected and was named OS49.67. 

A sample of the strain OS49.67 was deposited with the Collection Nationale de 
Cultures de Microorganismes (CNCM) Pasteur Institute, 25, rue du Docteur Roux 
75724 Paris Cedex 15, France, on July 10, 2002, under the registration number 1-2916. 

EXAMPLE 7: Construction of a strain of Streptomyces ambofaciens with a 
knockout in the or/8 gene 

To carry out the inactivation of the or/8 gene, a construct in which the Hhyg 
cassette was introduced into the coding sequence of or/8 was obtained. For this, the 
plasmid pOS49.88 was first of all constructed. The plasmid pOS49.88 is derived from 
the plasmid pUC19 (GenBank accession number: M77789) by insertion of a 3.7 kb 
fragment (Pstl-EcoRl fragment obtained from the cosmid pSPM5) containing the end of 
or/7, or/8 and the beginning of or/9, cloned into the Pstl-EcoRl sites of pUC19. The 
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Qhyg cassette (in the form of a BamWl fragment blunt-ended by treatment with the 
Klenow enzyme) was cloned into the unique Sail site of pOS49.88 located in or/8, after 
blunt-ending of all the ends by treatment with the Klenow enzyme. 

Since the cloning was blunt-ended, two types of plasmid were obtained depending 
on the direction of insertion of the cassette: pOS49.106 in which the hyg and or/8 genes 
are in the same orientation, and pOS49.120 in which the hyg and or/8 genes are in 
opposite orientations. The insert of the plasmid pOS49.106 was then subcloned into the 
plasmid pOJ260, to give pOS49.107. For this, the plasmid pOS49.106 was digested 
with the Asp718l enzyme and the ends were blunt-ended by treatment with the Klenow 
enzyme; this digestion product was redigested with the Pstl enzyme and the fragment 
containing the or/8 gene into which the Qhyg cassette was inserted was cloned into the 
vector pOJ260 (cf. above). For this, the vector pOJ260 was digested with the EcoKV 
and Pstl enzymes and used for the ligation. This manipulation therefore makes it 
possible to obtain an oriented ligation since each one of the two fragments is blunt- 
ended on one side and Pstl on the other. The plasmid obtained was named pOS49.107. 

The plasmid pOS49.107 was intoduced into the S. ambofaciens strain ATCC23877 
by protoplast transformation (T. Kieser, et al. 9 2000). After protoplast transformation, 
the clones are selected for their hygromycin resistance. The hygromycin-resistant clones 
are then subcultured respectively on medium with hygromycin (antibiotic B) and on 
medium with apramycin (antibiotic A) (cf. figure 9). The clones resistant to hygromycin 
(HygR) and sensitive to apramycin (ApraS) are in principle those in which a double 
crossing over event has occurred and which contain the or/8 gene interrupted with the 
Qhyg cassette. The replacement of the wild-type copy of or/8 with the copy interrupted 
by the Qhyg cassette was verified by Southern blotting. Thus, the total DNA of the 
clones obtained was digested with several enzymes, separated on agarose gel, 
transferred onto a membrane and hybridized with a probe corresponding to the Qhyg 
cassette in order to verify the presence of the cassette in the genomic DNA of the clones 
obtained. A second hybridization was carried out using as probe the Pstl-EcoRI insert 
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containing the end of or/7, or/8 and the beginning of or/9, approximately 3.7 kb in size, 
of the plasmid pOS49.88. The verification of the genotype can be carried out by any 
method known to those skilled in the art, and in particular by PCR using the appropriate 
oligonucleotides and sequencing of the PCR product. 

5 An or/8::Qhyg clone was chosen and named OS49.107. A sample of the strain 

OS49.107 was deposited with the Collection Nationale de Cultures de Microorganismes 
(CNCM) Pasteur Institute, 25, rue du Docteur Roux 75724 Pans Cedex 15, France, on 
July 1 0, 2002, under the registration number 1-291 7. 

10 EXAMPLE 8: Construction of a strain of Streptomyces ambofaciens with a 
knockout in the orflO gene 

A 1.5 kb DNA fragment inside the orflO gene was obtained by PCR using as 
matrix the genomic DNA of S. ambofaciens and the following primers: 

SRMRl : 5' CTGCCAGTCCTCTCCCAGCAGTACG 3' (SEQ ID No . 89) 
15 SRMR2 : 5' TGAAGCTGGACGTCTCCTACGTCGG 3' (SEQ ID No . 90) 

This PCR-derived DNA fragment was cloned into the vector pCR2.1 marketed by the 
company Invitrogen (Carlsbad, California, USA). The plasmid thus obtained was 
named pOS49.32. The Qhyg cassette (in the form of a BamHl fragment, cf. above) was 
cloned into the unique BstEll site inside the fragment of the orflO gene, after all the 

20 ends had been blunt-ended by treatment with the Klenow enzyme. Since the cloning 
was blunt-ended, two types of plasmid were obtained depending on the direction of 
insertion of the cassette: pOS49.43 in which the hyg and orflO genes are in the same 
orientation and pOS49.44 in which the hyg and orflO genes are in opposite orientations. 
The insert of the plasmid pOS49.43 was transferred (in the form of an Aspl\%\-Xba\ 

25 fragment the ends of which were blunt-ended by treatment with the Klenow enzyme) 
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into the EcoKV site of the plasmid pOJ260, which made it possible to obtain the 
plasmid pOS49.50. The plasmid pOS49.50 containing the fragment of the or/10 gene 
interrupted with the Qhyg cassette was introduced into the Streptomyces ambofaciens 
strain ATCC23877. After transformation, the clones were selected for their hygromycin 
resistance. The hygromycin-resistant clones were then subcultured respectively on 
medium with hygromycin (antibiotic B) and on medium with apramycin (antibiotic A) 
(cf. figure 9). The clones resistant to hygromycin (HygR) and sensitive to apramycin 
(ApraS) are in principle those in which a double crossing over event has occurred and 
which contain the or/10 gene interruped with the Qhyg cassette. Clones which 
contained the hygromycin resistance marker carried by the cassette and which had lost 
the apramycin resistance marker carried by the vector pOJ260 were thus obtained. The 
event consisting of replacement of the wild-type copy of or/10 with the orflOwQhyg 
interrupted copy was verified by Southern blotting. Thus, the total genomic DNA of the 
clones obtained was digested with several enzymes, separated on agarose gel, 
transferred onto a membrane and hybridized with a probe corresponding to the Qhyg 
cassette in order to verify the presence of the cassette in the genomic DNA of the clones 
obtained. A second hybridization was carried out using as probe the 1.5 kb PCR 
product in the orflO gene (cf. above). 

A clone exhibiting the expected characteristics (or/J O:\Qhyg) was more particularly 
selected and named OS49.50. It was in fact possible to verify, by virtue of two 
hybridizations, that the Qhyg cassette was indeed present in the genome of this clone 
and that the expected digestion profile is indeed obtained in the case of replacement, 
subsequent to a double recombination event, of the wild-type gene with the copy 
interrupted with the Qhyg cassette, in the genome of this clone. The verification of the 
genotype can also be carried out by any method known to those skilled in the art, and in 
particular by PCR using the appropriate oligonucleotides and sequencing of the PCR 
product. 
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EXAMPLE 9: Gene inactivation: principle of the construction of a strain of 
Streptomyces ambofaciens knocked out according to the "excisable cassette 9 ' 
technique (cf. figures 9 and 10) 

A second type of cassette can be used for the gene inactivation: cassettes termed 
"excisable cassettes". These cassettes have the advantage of being able to be excised in 
Streptomyces by a site-specific recombination event after having been introduced into 
the genome of S. ambofaciens. The aim is to inactivate certain genes in strains of 
Streptomyces without leaving, in the final strain, selection markers or large DNA 
sequences not belonging to the strain. After excision, only a short sequence of 
approximately thirty or so base pairs (called "scar" site) remains in the genome of the 
strain (cf. figure 10). 

The implementation of this system consists, firstly, in replacing the wild-type copy 
of the target gene (by virtue of two homologous recombination events, cf. figure 9) with 
a construct in which an excisable cassette has been inserted into this target gene. The 
insertion of this cassette is accompanied by a deletion in the target gene (cf. figure 9). 
Secondly, the excision of the excisable cassette from the genome of the strain is brought 
about. The excisable cassette functions by virtue of a site-specific recombination system 
and has the advantage of allowing Streptomyces mutants which do not ultimately carry 
a resistance gene to be obtained. Possible polar effects on the expression of the genes 
located downstream of the inactivated gene(s) are also avoided (cf. figure 10). 

The employment of excisable cassettes has been described and used in many 
organisms including mammalian cells and yeast cells and in E. coli (Bayley et aL, 1992; 
Brunelli and Pall, 1993; Camilli et aL, 1994; Dale and Ow, 1991; Russell et aL, 1992; 
Lakso et aL, 1992). These excisable cassettes all use the Cre site-specific recombinase 
which acts on lox sites. This recombination system originates from the PI 
bacteriophage. 
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To construct a system of the "excisable cassette" type in Streptomyces, the site- 
specific recombination system for the mobile genetic element pSAM2 of Streptomyces 
ambofaciens (Boccard et al, 1989a and b) is exploited. The system set up consists, 
initially, in constructing a recombinant vector comprising the gene to be interrupted, 
5 into which an excisable cassette is inserted. The insertion into the target gene of the 
excisable cassette is accompanied by a deletion in the target gene. It can be carried out 
by cloning using restriction sites present in the target gene or by recombination between 
short identical sequences as described, for example, by Chaveroche et al (Chaveroche 
MK et al, 2000). The excisable cassette can be constructed using, for example, the 

10 Qhyg cassette (Blondelet Rouault et al, 1997). This cassette was bordered by attR and 
attL sequences which normally flank the integrated copy of pSAM2 (cf. figure 15). The 
attL and attR sequences contain all the sites required for the site-specific recombination 
allowing excision of pSAM2 or of any DNA fragment located between these two 
regions (Sezonov et al, 1997, Raynal et al, 1998). The construction of such a cassette is 

15 obviously not limited to the use of an Qhyg cassette, but other resistance cassettes may 
be used as a basis to construct this cassette (for example the Qaac or Qvph cassette 
(Blondelet Rouault et al, 1997)). 

After having obtained this construct, the strain of Streptomyces is transformed with 
the recombinant plasmid. The transformants are then selected with the antibiotic 

20 corresponding to the cassette present in the target gene (cf. figure 9, antibiotic B, this 
involves, for example, selection with hygromycin if the excisable cassette derives from 
the Qhyg cassette). A mixture of clones among which there has been integration by a 
single or by two recombination events is thus selected. Subsequently, the clones 
sensitive to the antibiotic for which the resistance gene is present in the vector (outside 

25 the recombination cassette) (cf. figure 9, antibiotic A) are sought. It is thus possible to 
select the clones for which there have in principle been two recombination events 
resulting in the replacement of the wild-type gene with the copy interrupted with the 
cassette. These steps are shown diagrammatically in figure 9; the genotype of the clones 
thus obtained is verified by Southern blotting and a strain exhibiting the desired 
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characteristics (replacement of the wild-type gene with the . copy interrupted with the 
excisable cassette) is selected. 

Next, the strain selected above is transformed with a plasmid which allows 
expression of the xis and int genes, which are both necessary for the site-specific 
recombination between the attR and attL sites. This recombination leads to the 
excisable cassette leaving the genome of the strain, by virtue of a recombination event 
(cf. figure 10) (Raynal et aL, 1998). It is advantageous to choose the vector carrying the 
xis and int genes from the vectors which are relatively stable in Streptomyces (for 
example derived from the Streptomyces vector pWHM3, (Vara et al. 9 1989)), this 
makes it possible to obtain a strain which has lost the latter vector after a few 
sporulation cycles in the absence of selection pressure. 

To excise the cassette, it is possible, for example, to use the plasmid pOSV508 
(cf. figure 14), which is introduced by protoplast transformation into the strain of 
S. ambofaciens containing a gene interrupted with the excisable cassette. The plasmid 
pOSV508 is derived from the plasmid pWHM3 (J. Vara et a/., 1989) (cf. figure 13) to 
which the xis and int genes of pSAM2 (F. Boccard et at., 1989b) have been added, 
placed under the control of the ptrc promoter (E. Amann et al. 9 1988). The xis and int 
genes placed under the control of the ptrc promoter were subcloned into the plasmid 
pWHM3 from the plasmid pOSint3 (Raynal et ai, 1998) (cf. figure 14). Introduction of 
the plasmid pOSV508 carrying the xis and int genes of pSAM2, into the mutant strain, 
will allow effective excision, by site-specific recombination, of the excisable cassette 
between the attL and attR sites flanking this cassette (A. Raynal et al. 9 1998) (figure 
10). Among the transformants, selected for their thiostrepton resistance due to the tsr 
gene carried by pOSV508, those which have become sensitive to the antibiotic to which 
the presence of the cassette imparts resistance are chosen (cf. figure 10). The excision is 
effective and it was observed that more than 90% of the transformants are of this type. 
After one or more cycles of growth and sporulation on a solid medium lacking 
thiostrepton, clones which have lost the plasmid pOSV508 are obtained. These clones 
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can be detected by their sensitivity to thiostrepton. The sequence of the deleted target 
gene can be verified by PCR and sequencing of the PCR product. 

Finally, the resulting strain carries, in the inactivated gene (internal deletion for 
example), a "scar" att site corresponding to the minimum attB site (Raynal et al, 1998), 
5 derived from the recombination between the attR and attL sites. This minimum attB site 
which remains is similar to that naturally present in strains of Streptomyces 
ambofaciens, Streptomyces pristinaespiralis and Streptomyces lividans (Sezonov et al. 9 
1997). 

The gene whose inactivation is desired can be cotranscribed with other genes located 
10 downstream. To avoid the inactivation of one of the genes having a polar effect on the 
expression of the genes downstream in the operon, it is important to obtain an in-phase 
deletion after excision of the cassette. The excisable cassette system as described above 
makes it possible to satisfy such a requirement. Those skilled in the art may, in fact, 
easily construct three different excisable cassettes, said cassettes leaving, after excision, 
15 a sequence of 33, 34 or 35 nucleotides respectively, without a stop codon whatever the 
reading frame. Knowing the sequence of the target gene and the size of the deletion 
associated with the insertion of the cassette, it is possible to choose between these three 
excisable cassettes so that the excision produces an in-phase deletion. Of the 33, 34 or 
35 nucleotides added, 26 correspond to the minimum attB sequence (cf. figure 27). 

20 In the case of the present application, two excisable cassettes were used. These two 
cassettes are as follows: attlQhyg+ (SEQ ID No. 91) and att3Qaac- (SEQ ID No. 92); 
these cassettes leave respectively 33 and 35 nucleotides after excision. They comprise 
respectively the Qhyg cassette or the Qaac cassette, the + and - signs corresponding to 
the orientation of the resistance cassette. These two cassettes were constructed and 

25 cloned into the EcoRV site of the vector pBC SK+, the HindiW site of which was 
deleted beforehand. The plasmids obtained were named pattl£2hyg+ and patt3Qaac-, 
respectively. The excisable cassettes can be easily taken out by digestion of the plasmid 
with EcoRV. 



124 



EXAMPLE 10: Construction of a strain of Streptomyces ambofaciens with a 
knockout in the orf2 gene 

The inactivation of the or/2 gene was carried out using the excisable cassette 
technique (cf. above). The starting strain used is the Streptomyces ambofaciens strain 
OSC2 which derives from the strain ATCC23877. However, the strain OSC2 differs 
from the strain ATCC23877 in that it has lost the mobile genetic element pSAM2 
(Boccard et al. 9 1989a and b). This mobile element can be lost spontaneously during 
protoplastization (action of the lysozyme to digest the bacterial wall and fragment the 
mycelium (Kieser et al. 9 2000)) and regeneration of the protoplasts of the strain 
ATCC23877. To select the clones which had lost the pSAM2 element, a screen was set 
up, based on repression of the pra gene by the KorSA transcription repressor (Sezonov 
et al., 1995) (G. Sezonov et al, 2000). For this, a DNA fragment containing the 
promoter of the pra gene placed upstream of the aph gene (imparting kanamycin 
resistance and lacking its own promoter) was cloned into the unstable vector 
pWHM3Hyg, the latter deriving from the plasmid pWHM3 (Vara et al, 1989) in which 
the tsr gene has been replaced with the hyg gene (imparting hygromycin resistance). 
The plasmid thus obtained was named pOSV510. The strain ATCC23877 was 
transformed, after protoplastization, with the plasmid pOSV510. The Pra promoter is a 
promoter which is repressed by the KorSA repressor, the gene encoding the latter being 
located within the pSAM2 mobile element (Sezonov G. et al, 2000). After 
transformation with the plasmid pOSV510, the transformed bacteria are selected for 
their kanamycin resistance (due to the aph gene carried by pOSV510). The clones 
having lost the pSAM2 integrative element have lost the KorSA repressor and the Pra 
promoter is no longer repressed and allows expression of the aph kanamycin resistance 
gene. Selection with kanamycin after transformation with the plasmid pOSV510 
therefore makes it possible to select the clones which have lost the pSAM2 integrative 
element (and therefore KorSA) and which contain the plasmid pOSV510. Since the 
plasmid pOSV510 is unstable, after a few cycles of sporulation without antibiotic, 
isolated clones are subcultured on medium with kanamycin, on medium with 
hygromycin and on medium without antibiotic. The clones sensitive to kanamycin and 
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to hygromycin have lost pOSV510. The loss of the pSAM2 element was verified by 
hybridization and PCR. A clone exhibiting the desired characteristics was selected and 
was named OSC2. 

A sample of the strain OSC2 was deposited with the Collection Nationale de 
5 Cultures de Microorganismes (CNCM) Pasteur Institute, 25, rue du Docteur Roux 
75724 Paris Cedex 15, France, on July 10, 2002, under the registration number 1-2908. 

The inactivation of the orf2 gene was carried out using the excisable cassette 
technique (cf. above and figure 10). For this, a 4.5 kb insert, the sequence of which 
starts from the EcoRI site located at position 1 up to the BamUl site located at position 
10 4521 (SEQ ID No. 1), was subcloned into the EcoRI and BamHl sites of the plasmid 
pUC19 (GenBank accession number: M77789) from the cosmid pSPM5. The plasmid 
thus obtained was named pOS49.99. 

This plasmid was introduced into the E. coli strain KS272, which already 
contained the plasmid pKOBEG (Chaveroche et al. 9 2000) (cf. figure 12). 

15 In parallel, the att3Qaac- excisable cassette (SEQ ID No. 92, cf. above) was 

amplified by PCR using as matrix' the plasmid pOSKl 102 (the plasmid pOSKl 102 is a 
plasmid derived from the vector pGP704Not (Chaveroche et al, 9 2000) (V.L. Miller & 
J.J. Mekalanos, 1988) in which the att3Qaac- cassette has been cloned, as an EcoRV 
fragment into the unique EcoRV site of pGP704Not) and using the following primers: 

20 

ORF2A 

5 ' CCCGCGCGGCAGCCTCTCCGTGATCGAGTCCGGCGTGACCATCGCGCGCGCTTCGTTCGG- 3 ' 
(SEQ ID No. 93) 
ORF2B 

25 5 ' GCTCCGTGCGTCATGCAGGAAGGTGTCGTAGTCGCGGTAGATCTGCCTCTTCGTCCCGAA - 3 ' 
(SEQ ID No. 94) 

The 40 deoxynucleotides located at the 5' end of these oligonucleotides 
comprise a sequence corresponding to a sequence in the target gene {prf2 in the present 
case) and the 20 deoxynucleotides located in the most 3' position (shown in bold 
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above) correspond to the sequence of one of the ends of the attiQaac- excisable 
cassette (cf. figure 1 1). 

The PCR product thus obtained was used to transform the E. coli strain 
containing the plasmids pKOBEG and pOS49.99 as described (Chaveroche et al, 2000) 
(cf. figure 12). Thus, the bacteria were transformed by electroporation and selected for 
their apramycin resistance. The plasmids of the clones obtained were extracted and 
digested with several restriction enzymes, for the purpose of verifying that the digestion 
profile obtained corresponds to the profile expected if there has been insertion of the 
cassette (att3Qaac-) into the target gene (prf2\ i.e. if there has indeed been homologous 
recombination between the ends of the PCR product and the target gene (Chaveroche et 
al., 2000). The verification of the construct can also be carried out by any method 
known to those skilled in the art, and in particular by PCR using the appropriate 
oligonucleotides and sequencing of the PCR product. A clone in which the plasmid has 
the expected profile was selected and the corresponding plasmid was named pSPM17. 
This plasmid derives from pOS49.99 in which or/2 is interrupted with the apramycin 
cassette (cf. figure 12). 

The insertion of the cassette is accompanied by a deletion in orf2, between the 
nucleotides 21 1 and 492 of the coding portion of orf2. 

The plasmid pSPM17 was digested with the £coRI enzyme and the ends were then 
blunt-ended by treatment with the Klenow enzyme; this digestion product was then 
digested with the Xbal enzyme and the insert containing the deleted orf2 gene was 
cloned into the vector pOSK1205 (cf. above). For this, the vector pOSK1205 was 
digested with the BamHl enzyme and the ends were then blunt-ended by treatment with 
the Klenow enzyme; this product was then digested with the Xbal enzyme and used for 
the ligation with the insert obtained from pSPM17 as above. This manipulation 
therefore makes it possible to obtain an oriented ligation since each one of the two 
fragments is blunt-ended on one side and Xbal on the other. The plasmid thus obtained 
was named pSPM21; it carries a hygromycin resistance gene (vector portion) and an 
insert in which the deleted orf2 gene is replaced with the att3Qaac- cassette. 
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The vector pSPM21 was introduced into the Streptomyces ambofaciens strain OSC2 
(cf. above) by protoplast transformation (T. Kieser et ai 9 2000). After transformation, 
the clones were selected for their apramycin resistance. The apramycin-resistant clones 
were then subcultured respectively on medium with apramycin (antibiotic B) and on 
medium with hygromycin (antibiotic A) (cf. figure 9). The clones resistant to apramycin 
(ApraR) and sensitive to hygromycin (HygS) are in principle those in which a double 
crossing over event has occurred and which contain the or/2 gene interrupted with the 
att3Qaac- cassette. These clones were selected and the replacement of the wild-type 
copy of orf2 with the copy interrupted with the cassette was verified. The presence of 
the att3Qaac- cassette was verified by colony PCR. A hybridization was also carried 
out. For this, the total DNA of the clones obtained was digested with several enzymes, 
separated on agarose gel, transferred onto a membrane and hybridized with a 3 kb probe 
corresponding to an EcoRl-BamHl fragment of the insert of the plasmid pOS49.99 
(cf. above). The verification of the genotype can also be carried out by any method 
known to those skilled in the art, and in particular by PCR using the appropriate 
oligonucleotides and sequencing of the PCR product. 

A clone exhibiting the expected characteristics was selected and named SPM21. It 
was possible to verify, using PCR and hybridization, that the att3Qaac- cassette was 
indeed present in the genome of this clone and that the digestion profile expected in the 
case of replacement, subsequent to a double recombination event, of the wild-type gene 
with the copy interrupted with the att3Qaac- cassette in the genome of this clone is 
indeed obtained. This clone therefore has the genotype: orf2::att3Qaac-. 

A sample of the strain SPM21 was deposited with the Collection Nationale de 
Cultures de Microorganismes (CNCM) Pasteur Institute, 25, rue du Docteur Roux 
75724 Paris Cedex 15, France, on July 10, 2002, under the registration number 1-2914. 

The strain SPM21 was transformed with the vector pOSV508 by protoplast 
transformation in order to bring about excision of the cassette (cf. figure 14). The 
plasmid pOSV508 is derived from the plasmid pWHM3 (J. Vara et a/., 1989) 
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(cf. figure 13) in which the xis and int genes of pSAM2 (F. Boccard et al, 1989b) have 
been added, placed under the control of the ptrc promoter (E. Amann et al., 1988) 
(cf. figure 14). The introduction, into the strain SPM21, of the plasmid pOSV508 
carrying the xis and int genes of pSAM2 allows effective excision, by site-specific 
5 recombination, of the excisable cassette between the attL and attR sites flanking this 
cassette (A. Raynal et aL, 1998) (figure 10). Among the transformants selected for their 
thiostrepton resistance due to the tsr gene carried by pOSV508, those which have 
become sensitive to apramycin, the resistance gene for which is carried by the 
att3f2aac- cassette, are chosen; the excision in fact leads to the loss of this resistance 

10 gene (cf. figure 10). The plasmid pOSV508 is unstable and, after two successive 
passages on medium without antibiotic, isolated clones are subcultured on medium with 
thiostrepton and on medium without thiostrepton. The thiostrepton-sensitive clones 
have lost pOSV508. It was verified that the excision of this cassette indeed results in an 
in-phase deletion in the or/2 gene, by PCR and sequencing of the PCR product; the 

15 excision of the cassette in fact leaves a characteristic "scar" att3 sequence (which is 
similar to the attB site of origin after recombination between the attL and attR sites): 

5' ATCGCGCGCGCTTCGTTCGGGACGAAGAGGTAGAT 3' (SEQ ID No. 95). 

The strain thus obtained which has the desired genotype (orf2::att3) was named 
SPM22. 

20 A sample of the strain SPM22 was deposited with the Collection Nationale de 

Cultures de Microorganismes (CNCM) Pasteur Institute, 25, rue du Docteur Roux 
75724 Paris Cedex 15, France, July 10, 2002, under the registration number 1-2915. 
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EXAMPLE 11: Construction of a strain of Streptomyces ambofaciens with a 
knockout in the or/12 gene 

For the inactivation of or/12, orfl3c and or/14, the same starting plasmid 
(pSPM504) was used to introduce a cassette of the "excisable cassette" type, at various 
5 positions. This plasmid contains a 15.1 kb insert which corresponds to the region from 
or/7 to or/17. To construct this plasmid, a 15.1 kb BgRl fragment originating from 
digestion of the cosmid pSPM7 (cf. above) was cloned into the plasmid pMBL18 
(Nakano et aL 9 1995) digested with Bamlih Since BamHl and Bglll ends are 
compatible, the plasmid pSPM502 is obtained after ligation. The entire insert of 
10 pSPM502 was then subcloned (in the form of a HindVWNhel fragment) into the plasmid 
pOSK1205 (digested with HindllUNhel), which made it possible to obtain the plasmid 
pSPM504. 

This plasmid was introduced into the E. coli strain KS272 which already 
contained the plasmid pKOBEG (Chaveroche et ai 9 2000) (cf. figure 12). 

15 In parallel, the att3Qaac- excisable cassette was amplified by PCR using as 

matrix the plasmid pOSKl 102 (the plasmid pOSKl 102 is a plasmid derived from the 
vector pGP704Not (Chaveroche et al, 2000) (V.L. Miller & JJ. Mekalanos, 1988) in 
which the att3Qaac- cassette has been cloned, as an EcoRV fragment, into the unique 
EcoKV site of pGP704Not); the primers used are as follows: 

20 EDR8 : 5 ' CGGGATGATCGCTTGTCCGGCGGCCGGATGCCTAGCCTCATCGCGCGCGCTTCGTTCGG 3 ' 
(SEQ ID No. 96) 

EDR9 : 5 ' CCCGATCCAGAACGTCTGGTCGGTGATCAGGTCGCTGTTCATCTGCCTCTTCGTCCCGAA 
3' (SEQ ID No. 97) 

25 The 40 (only 39 for EDR8) deoxynucleotides located at the 5' end of these 

oligonucleotides comprise a sequence corresponding to a sequence in the target gene 



130 



(orfl2 in the present case) and the 20 deoxynucleotides located in the most 3' position 
(shown in bold above) correspond to the sequence of one of the ends of the att3Qaac- 
excisable cassette (cf. figure 1 1). 

The PCR product thus obtained was used to transform the E. coli strain KS272 
containing the plasmids pKOBEG and pSPM504 (cf. above), as described by 
Chaveroche et al (Chaveroche et aL, 2000) (cf figure 12 for the principle, the plasmid 
pOS49.99 should be replaced by the plasmid pSPM504 and the plasmid obtained is no 
longer pSPM17 but pSPM507). Thus, the bacteria were transformed by electroporation 
with this PCR product and the clones were selected for their apramycin resistance. The 
plasmids of the clones obtained were extracted and digested with several restriction 
enzymes for the purpose of verifying that the digestion profile obtained corresponds to 
the profile expected if there has been insertion of the cassette (att3Qaac-) into the target 
gene (orfl2\ i.e. if there has indeed been homologous recombination between the ends 
of the PCR product and the target gene (Chaveroche et al, 2000). The verification of 
the construct can also be carried out by any method known to those skilled in the art, 
and in particular by PCR using the appropriate oligonucleotides and sequencing of the 
PCR product. A clone in which the plasmid has the expected profile was selected and 
the corresponding plasmid was named pSPM507. This plasmid derives from pSPM504 
in which or/12 is interrupted with the att3Qaac- cassette (cf. figure 12). The insertion 
of the cassette is accompanied by a deletion in the orfl2 gene, the interruption begins at 
the thirtieth codon of orfJ2. The last 46 codons of orfl2 remain after the cassette. 

The vector pSPM507 was introduced into the Streptomyces ambofaciens strain 
OSC2 (cf. above) by protoplast transformation (T. Kieser et al, 2000). After 
transformation, the clones were selected for their apramycin resistance. The apramycin- 
resistant clones were then subcultured respectively on medium with apramycin 
(antibiotic B) and on medium with hygromycin (antibiotic A) (cf figure 9). The clones 
resistant to apramycin (ApraR) and sensitive to hygromycin (HygS) are in principle 
those in which a double crossing over event has occurred and which contain the orfl2 
gene interrupted with the att3Qaac- cassette. These clones were more particularly 
selected and the replacement of the wild-type copy of orfl2 with the copy interrupted 



131 



with the cassette was verified by hybridization. Thus, the total DNA of the clones 
obtained was digested with several enzymes, separated on agarose gel, transferred onto 
a membrane and hybridized with a probe corresponding to the att3Qaac- cassette in 
order to verify the presence of the cassette in the genomic DNA of the clones obtained. 
A second hybridization was carried out using as probe a DNA fragment obtained by 
PCR and corresponding to a very large portion of the coding sequence of the orfl2 
gene. 

The verification of the genotype can also be carried out by any method known to 
those skilled in the art, and in particular by PCR using the appropriate oligonucleotides 
and sequencing of the PCR product. 

A clone exhibiting the expected characteristics (orf!2::att3Qaac-) was more 
particularly selected and named SPM507. It was in fact possible to verify, by virtue of 
the two hybridizations, that the att 3 Qaac- cassette was indeed present in the genome of 
this clone and that the digestion profile expected in the case of replacement, subsequent 
to a double recombination event, of the wild-type gene with the copy interrupted with 
the att3Qaac- cassette in the genome of this clone is indeed obtained. This clone 
therefore has the genotype: orfl2::att3Qaac- and was named SPM507. Given the 
orientation of the genes (cf. figure 3), there is no need to excise the cassette to study the 
effect of the inactivation of or/72. Specifically, the fact that orf!3c is oriented in the 
opposite direction to orf!2 shows that these genes are not cotranscribed. The use of an 
excisable cassette makes it possible, on the other hand, to have the possibility of being 
rid of the selection marker at any moment, in particular by transformation of the 
plasmid pOSV508. A sample of the strain SPM507 was deposited with the Collection 
Nationale de Cultures de Microorganismes (CNCM) Pasteur Institute, 25, rue du 
Docteur Roux 75724 Paris Cedex 15, France, on July 10, 2002 under the registration 
number 1-2911. 
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EXAMPLE 12: Construction of a strain of Streptomyces ambofaciens with a 
knockout in the orfl3c gene 

The att3Qaac- excisable cassette was amplified by PCR using as matrix the 
plasmid pOSKl 102 (cf. above), using the following primers: 

5 

EDR3 : 5 ' ACCGGGGCGGTCCTCCCCTCCGGGGCGTCACGGCCGCGGAATCTGCCTCTTCGTCCCGAA 
3' (SEQ ID No. 98) 

EDR4 : 5 ' CACGCAGCGAGCCGACGCACTGATGGACGACACGATGGCCATCGCGCGCGCTTCGTTCGG 
3' (SEQ ID No. 99) 

10 

The 40 deoxynucleotides located at the 5' end of these oligonucleotides comprise a 
sequence corresponding to a sequence in the target gene (orfl3c in the present case) and 
the 20 deoxynucleotides located in the most 3' position (shown in bold above) 
correspond to the sequence of one of the ends of the att3Qaac- excisable cassette 
15 (cf. figure 11). 

The PCR product thus obtained was used to transform the E, coli strain KS272 
containing the plasmids pKOBEG and pSPM504 (cf. above), as described by 
Chaveroche et al (Chaveroche et al, 2000) (cf. figure 12 for the principle, the plasmid 
pOS49.99 should be replaced by the plasmid pSPM504 and the plasmid obtained is no 

20 longer pSPM17 but pSPM508). Thus, the bacteria were transformed by electroporation 
with this PCR product and the clones were selected for their apramycin resistance. The 
plasmids of the clones obtained were extracted and digested with several restriction 
enzymes for the purpose of verifying that the digestion profile obtained corresponds to 
the profile expected if there has been insertion of the cassette (att3Qaac-) into the target 

25 gene {prfl3c\ i.e. if there has indeed been homologous recombination between the ends 
of the PCR product and the target gene (Chaveroche et al. 9 2000). The verification of 
the construct can also be carried out by any method known to those skilled in the art, 
and in particular by PCR using the appropriate oligonucleotides and sequencing of the 
PCR product. A clone in which the plasmid has the expected profile was selected and 
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the corresponding plasmid was named pSPM508. This plasmid derives from pSPM504 
in which orfl3c is interrupted with the apramycin cassette (cf. figure 12). The insertion 
of the cassette is accompanied by a deletion in the orfl3c gene, the interruption begins 
at the sixth codon of orfl3c. The last 3 codons of orfl3c remain after the cassette. 

The vector pSPM508 was introduced into the Streptomyces ambofaciens strain 
OSC2 (cf. above) by protoplast transformation (T. Kieser et al. 9 2000). After 
transformation, the clones were selected for their apramycin resistance. The apramycin- 
resistant clones were then subcultured respectively on medium with apramycin 
(antibiotic B) and on medium with hygromycin (antibiotic A) (cf. figure 9). The clones 
resistant to apramycin (ApraR) and sensitive to hygromycin (HygS) are in principle 
those in which a double crossing over event has occurred and which contain the orfl3c 
gene interrupted with att3Qaac- cassette. These clones were more particularly selected 
and the replacement of the wild-type copy of orfl3c with the copy interrupted with the 
cassette was verified by hybridization. Thus, the total DNA of the clones obtained was 
digested with several enzymes, separated on agarose gel, transferred onto a membrane 
and hybridized with a probe corresponding to the att3Qaac- cassette in order to verify 
the presence of the cassette in the genomic DNA of the clones obtained. A second 
hybridization was carried out using as probe a PCR product corresponding to a 
sequence extending by about a hundred base pairs upstream and downstream of the 
coding sequence of orfl 3c. The verification of the genotype can also be carried out by 
any method known to those skilled in the art, and in particular by PCR using the 
appropriate oligonucleotides and sequencing of the PCT product. 

A clone exhibiting the expected characteristics {orfl 3 c:\att3Qaac-) was more 
particularly selected and named SPM508. It was in fact possible to verify, by virtue of 
the two hybridizations, that the att3Qaac- cassette was indeed present in the genome of 
this clone and that the digestion profile expected in the case of replacement, subsequent 
to a double recombination event, of the wild-type gene with the copy interrupted with 
the att3Qaac- cassette in the genome of this clone is indeed obtained. This clone 
therefore has the genotype: orfl3c::att3Qaac- and was named SPM508. Given the 
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orientation of the genes (cf. figure 3), there is no need to excise the cassette to study the 
effect of the inactivation of orfl3c. The fact that orfl4 is oriented in the opposite 
direction to orflSc shows that these genes are not cotranscribed. The use of an excisable 
cassette makes it possible, on the other hand, to add the possibility of being rid of the 
5 selection marker at any moment. A sample of the strain SPM508 was deposited with the 
Collection Nationale de Cultures de Microorganismes (CNCM) Pasteur Institute, 25, 
rue du Docteur Roux 75724 Paris Cedex 15, France, on July 10, 2002 under the 
registration number 1-2912. 

10 EXAMPLE 13 Construction of a strain of Streptomyces ambofaciens with a 
knockout in the orfl4 gene 

The att3Qaac- excisable cassette was amplified by PCR using as matrix the 
plasmid pOSKl 102 (cf. above), using the following primers: 

EDR5 : 5 ' GGGCGTGAAGCGGGCGAGTGTGGATGTCATGCGAGTACTCATCGCGCGCGCTTCGTTCGG 
15 3' (SEQ ID No. 100) 

EDR6 : 5 ' CGGGAAACGGCGTCGCACTCCTCGGGGGCCGCGTCAGCCCATCTGCCTCTTCGTCCCGAA 
3' (SEQ ID No. 101) 



The 40 deoxynucleo tides located at the 5' end of these oligonucleotides comprise a 
20 sequence corresponding to a sequence in the target gene (prfl4 in the present case) and 
the 20 deoxynucleotides located in the most 3' position (shown in bold above) 
correspond to the sequence of one of the ends of the att3Qaac- excisable cassette 
(cf. figure 1 1). 
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The PCR product thus obtained was used to transform the E. coli strain KS272 
containing the plasmids pKOBEG and pSPM504 (cf. above), as described by 
Chaveroche et al. (Chaveroche et al, 2000) (cf. figure 12 for the principle, the plasmid 
pOS49.99 should be replaced by the plasmid pSPM504 and the plasmid obtained is no 
longer pSPM17 but pSPM509). Thus, the bacteria were transformed by electroporation 
with this PCR product and the clones were selected for their apramycin resistance. The 
plasmids of the clones obtained were extracted and digested with several restriction 
enzymes for the purpose of verifying that the digestion profile obtained corresponds to 
the profile expected if there has been insertion of the cassette (att3Qaac-) into the target 
gene (or/14), i.e. if there has indeed been homologous recombination between the ends 
of the PCR product and the target gene (Chaveroche et al, 2000). The verification of 
the construct can also be carried out by any method known to those skilled in the art, 
and in particular by PCR using the appropriate oligonucleotides and sequencing of the 
PCR product. A clone in which the plasmid has the expected profile was selected and 
the corresponding plasmid was named pSPM509. This plasmid derives from pSPM504 
in which orfl4 is interrupted with the apramycin cassette (cf. figure 12). The insertion 
of the cassette is accompanied by a deletion in the orfl4 gene, the interruption begins at 
the fourth codon of or/14. The final codon of or/74 remains after the cassette. 

The vector pSPM509 was introduced into the Streptomyces ambo/aciens strain 
OSC2 (cf. above) by protoplast transformation (T. Kieser et al, 2000). After 
transformation, the clones were selected for their apramycin resistance. The apramycin- 
resistant clones were then subcultured respectively on medium with apramycin 
(antibiotic B) and on medium with hygromycin (antibiotic A) (cf. figure 9). The clones 
resistant to apramycin (ApraR) and sensitive to hygromycin (HygS) are in principle 
those in which a double crossing over event has occurred and which contain the or/14 
gene interrupted with att3Qaac- cassette. These clones were more particularly selected 
and the replacement of the wild-type copy of or/14 with the copy interrupted with the 
cassette was verified by hybridization. Thus, the total DNA of the clones obtained was 
digested with several enzymes, separated on agarose gel, transferred onto a membrane 
and hybridized with a probe corresponding to the att3Qaac- cassette in order to verify 
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the presence of the cassette in the genomic DNA of the clones obtained. A second 
hybridization was carried out using as probe a PCR product corresponding to a 
sequence extending by about a hundred base pairs upstream and downstream of the 
coding sequence of or/14. The verification of the genotype can also be carried out by 
5 any method known to those skilled in the art, and in particular by PCR using the 
appropriate oligonucleotides and sequencing of the PCT product. 

A clone exhibiting the expected characteristics (orfl4::att3Qaac-) was more 
particularly selected and named SPM509. It was in fact possible to verify, by virtue of 
the two hybridizations, that the att3Qaac- cassette was indeed present in the genome of 

10 this clone and that the digestion profile expected in the case of replacement, subsequent 
to a double recombination event, of the wild-type gene with the copy interrupted with 
the att3Qaac- cassette in the genome of this clone is indeed obtained. This clone 
therefore has the genotype: orfl4::att3Qaac- and was named SPM509. Given the 
orientation of the genes (cf. figure 3), there is no need to excise the cassette to study the 

15 effect of the inactivation of orfl4. The fact that orflSc is oriented in the opposite 
direction to orfl4 shows that these genes are not cotranscribed. The use of an excisable 
cassette makes it possible, on the other hand, to add the possibility of being rid of the 
selection marker at any moment; A sample of the strain SPM509 was deposited with the 
Collection Nationale de Cultures de Microorganismes (CNCM) Pasteur Institute, 25, 

20 rue du Docteur Roux 75724 Paris Cedex 15, France, on July 10, 2002 under the 
registration number 1-2913. 



EXAMPLE 14: Construction of a strain of Streptomyces ambofaciens with a 
knockout in the orf6* gene 

25 The inactivation of the or/6* gene was carried out using the excisable cassette 

technique (cf. above and figure 10). For this, the cosmid pSPM7 was used as matrix to 
amplify a fragment of the orf6* gene using the following oligonucleotides: 
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C9583: 5' CTGCAGGTGCTCCAGCGCGTCGATCT 3' (oligo sense) (SEQ ID No. 102) 

C9584: 5' CTGCAGACGGAGGCGGACCTGCGGCT 3' (oligo antisense) (SEQ ID 
No. 103) 

The 20 deoxynucleotides located at the 3' end of these oligonucleotides 
5 correspond to a sequence located in the coding portion of the or/6* gene (SEQ ID 
No. 13) and the 6 deoxynucleotides located in the most 5' position correspond to the 
sequence of a Pstl site which facilitates subsequent cloning. The amplified DNA 
fragment is approximately 1.11 kb in size. This PCR product was cloned into the vector 
pGEM-T Easy (marketed by the company Promega (Madison, Wisconsin, USA)), 
10 which made it possible to obtain the plasmid named pBXLl 1 1 1 (cf. figure 1 6). 

The attl£2hyg+ excisable cassette was then introduced into the coding sequence 
of the orf6* gene. For this, the plasmid pBXLl 1 1 1 was digested with the Smal and 
Asp7\8l restriction enzymes and the digestion product was treated with the Klenow 
enzyme. This manipulation makes it possible to produce an internal deletion of 120 bp 

15 in the coding sequence of the or/6* gene (cf. figure 15). In addition, on either side of 
the restriction sites, there remain' respectively 511 bp and 485 bp of the sequence of 
or/6* which will allow the homologous recombination for the inactivation of the or/6* 
gene. The attl£2hyg+ excisable cassette was prepared from the plasmid pattl£1hyg+ 
(cf. above) by digestion of this plasmid with EcoRV. The latter was then subcloned into 

20 the vector pBXLllll prepared beforehand as described above (Smal and Aspl 1 81 
digestion and then treatment with the Klenow enzyme). The plasmid obtained was 
named pBXLl 1 12 (cf. figure 17). In this construct, the orf6* gene comprises a 120 bp 
deletion and is interrupted with the attlQhyg+ cassette. 

The plasmid pBXL1112 was then digested with the Pstl enzyme (site bordering 
25 the cassette since present in the PCR oligonucleotides) and the 3.7 kb Pstl insert 
comprising a portion of orf 6* interrupted with the attlQhyg+ cassette was then cloned 
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into the Pstl site of the plasmid pOJ260 (cf. above). The plasmid thus obtained was 
named pBXL1113. 

The vector pBXLl 113 was introduced into the Streptomyces ambofaciens strain 
OSC2 (cf. above) by protoplast transformation (T. Kieser et al. 9 2000). After 
5 transformation, the clones were selected for their hygromycin resistance. The 
hygromycin-resistant clones were then subcultured respectively on medium with 
hygromycin (antibiotic B) and on medium with apramycin (antibiotic A) (cf. figure 9). 
The clones resistant to hygromycin (HygR) and sensitive to apramycin (ApraS) are in 
principle those in which a double crossing over event has occurred and which contain 

10 the orf6* gene interrupted with the attlQhyg+ cassette. These clones were more 
particularly selected and the replacement of the wild-type copy of or/6* with the copy 
interrupted with the cassette was verified by the Southern blotting technique. Thus, the 
total DNA of the clones obtained was digested with several enzymes, separated on 
agarose gel, transferred onto a membrane and hybridized with a probe corresponding to 

15 the hyg gene (obtained by PCR) in order to verify the presence of the cassette in the 
genomic DNA of the clones obtained. A second hybridization was carried out using as 
probe the Pst\-Pst\ insert containing the orf6* gene, approximately 1.1 kb in size, and 
obtained from the plasmid pBXLl 1 1 1 (cf. above and figure 16). The verification of the 
genotype can also be carried out by any method known to those skilled in the art, and in 

20 particular by PCR using the appropriate oligonucleotides and sequencing of the PCR 
product. 

A clone exhibiting the expected characteristics (orf6*::attJ&hyg+) was more 
particularly selected and named SPM501 . It was in fact possible to verify, using the two 
hybridizations, that the attlQhyg+ cassette was indeed present in the genome of this 
25 clone and that the digestion profile expected in the case of replacement, subsequent to a 
double recombination event, of the wild-type gene with the copy interrupted with the 
attlf2hyg+ cassette in the genome of this clone is indeed obtained. This clone therefore 
has the genotype: orf6*::attl£2hyg+ and was named SPM501. A sample of the strain 
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SPM501 was deposited with the Collection Nationale de Cultures de Microorganismes 
(CNCM) Pasteur Institute, 25, rue du Docteur Roux 75724 Paris Cedex 15, France, on 
July 1 0, 2002, under the registration number 1-2909. 

The strain SPM501 was transformed with the vector pOSV508 by protoplast 
5 transformation in order to bring about excision of the cassette (cf. figure 14). The 
plasmid pOSV508 is derived from the plasmid pWHM3 (J. Vara et aL, 1989) (cf. figure 

13) in which the xis and int genes of pSAM2 (F. Boccard et aL, 1989b), placed under 
the control of the ptrc promoter (E. Amann et aL, 1988), have been added (cf. figure 

14) . The introduction, into the strain SPM501, of the plasmid pOSV508 carrying the xis 
10 and int genes of pSAM2 allows effective excision, by site-specific recombination, of 

the excisable cassette between the attL and attR sites flanking this cassette (A. Raynal 
et aL, 1998) (figure 10). Among the transformants, selected for their thiostrepton 
resistance due to the tsr gene carried by pOSV508, those which have become sensitive 
to hygromycin, for which the resistance gene is carried by the attlOhyg+ cassette, are 

15 chosen; the excision in fact leads to the loss of this resistance gene (cf. figure 10). The 
plasmid pOSV508 is unstable and, after two successive passages on medium without 
antibiotic, isolated clones are subeultured on medium with thiostrepton and on medium 
without thiostrepton. The thiostrepton-sensitive clones have lost pOSV508. It was 
verified that the excision of the cassette had indeed occurred in phase in the or/6* gene, 

20 by PCR and sequencing of the PCR product. The interruption begins at the 158th 
codon, 40 codons are deleted (120 bp), and the excision of the cassette leaves a 
characteristic "scar" attl sequence of 33 bp: 

5' ATCGCGCGCTTCGTTCGGGACGAAGAGGTAGAT 3 ' (SEQ ID No. 104). 

The strain thus obtained and having the desired genotype (or/6*::attJ) was named 
25 SPM502. 
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A sample of the strain SPM502 was deposited with the Collection Nationale de 
Cultures de Microorganismes (CNCM) Pasteur Institute, 25, rue du Docteur Roux, 
75724 Paris Cedex 15, France, on July 10, 2002, under the registration number 1-2910. 



5 EXAMPLE 15: Analysis of the strains of Streptomyces ambofaciens with a 
knockout in the orf2, orJ3, orfS, or/10, orfl2 9 orfl3c, orfl4 or orf6* gene 

In order to test the spiramycin production of the various strains obtained, a 
microbiological test based on the sensitivity of a strain of Micrococcus luteus was 
developed (cf. (A. Gourmelen et al. 9 1998)). The strain of Micrococcus luteus used is a 

10 strain derived from the strain DSM1790 naturally sensitive to spiramycin (this strain is 
available in particular from the German Collection of Microorganisms and Cell 
Cultures (Deutsche Sammlung von Mikro-organismen und Zellkulturen GmbH, 
DSMZ), (Braunschweig, Germany), under the number DSM1790); the strain used in the 
present test differs from the strain DSM1790 in that it is resistant to congocidine. This 

15 strain is a spontaneous mutant obtained by selection on medium containing increasing 
doses of congocidine. Such a strain was selected due to the fact that Streptomyces 
ambofaciens produces both spiramycin and congocidine. Since the aim is to assay the 
spiramycin production of the various strains obtained using a microbiological test based 
on the sensitivity of a strain of Micrococcus luteus, it is necessary to have a 

20 congocidine-resistant strain. 

The various strains of Streptomyces to be tested were cultured in 500 ml 
Erlenmeyer flasks with baffles (baffled Erlenmeyers) containing 70 ml of MPS medium 
(Pernodet et aL, 1993). The baffled Erlenmeyers were inoculated at an initial 
concentration of 2.5 x 10 6 spores/ml of the various strains of S. ambofaciens and grown 
25 at 27°C with orbital shaking at 250 rpm. 2 ml samples of suspensions were taken after 
48, 72 and 96 hours of culturing, and centrifuged. The various supernatants were then 
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frozen at -20°C. A ten-fold dilution of these supernatants in sterile culture medium is 
used for the test (cf. figure 1 8). 

The Micrococcus luteus indicator strain resistant to congocidine but sensitive to 
spiramycin was cultured in 2TY medium (Sambrook et al. y 1989) containing 
5 congocidine at 5 ^g/ml, for 48h at 37°C. The optical density (OD) of the culture is 
measured and this culture is diluted so as to adjust the optical density to 4. 0.4 ml of this 
preculture is diluted in 40 ml of DAMS medium (Difco Antibiotic Medium 5, marketed 
by the company Difco), brought to a temperature of approximately 45°C beforehand. 
This medium is then poured into a 12 x 12 cm square dish and left to stand at ambient 
10 temperature. 

Once the medium had cooled and solidified, disks of Whatman AA paper 
(cf. A. Gourmelen et al, 1998), 12 mm in diameter, were soaked with 70 |il of the ten- 
fold dilution of each supernatant and placed on the surface of the dish. Disks soaked 
with a solution of spiramycin of various concentrations (2-4-8 ng/ml in MPS culture 
15 medium) are used as a standard range. The dishes were left at 4°C for 2 h so as to allow 
diffusion of the antibiotics into the agar, and were then incubated at 37°C for 24 to 
48 h. 

If the disk contains spiramycin, this diffuses into the agar and inhibits the growth 
of the Micrococcus luteus indicator strain. This inhibition creates a "halo" around the 

20 disk, this halo reflecting the area in which the Micrococcus luteus strain has not grown. 
The presence of this halo is therefore an indication of the presence of spiramycin and 
makes it possible to determine whether the strain of S. ambofaciens corresponding to 
the disk in question is or is not a spiramycin producer. Comparison with the inhibition 
diameters obtained for the standard range makes it possible to obtain an indication of 

25 the amount of spiramycin produced by this strain. 

The various strains described in the preceding examples were used in this test in 
order to detect their spiramycin production. The results obtained were grouped together 
in Table 38. 
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Table 38 



Strain 


Inactivated gene 


Example in 
which the 
strain is 
described 


Phenotype: Producer 
(+) or nonproducer (- 
) of spiramycin 


ATCC2387 
7 


None 


1 


(+) 


OS49.16 


orfiy.Wtyg 


2 


(-) 


OS49.67 


In-phase deletion orf3 


6 


o 


OS49.107 


orfSy.Qhyg 


7 


<-) 


OS49.50 


or/10:: Qhyg 


8 


<-> 


OSC2 


None 


10 


(+) 


SPM21 


orf2::att3Qa3ac- 


10 


(-> 


SPM22 


In-phase deletion orf2::att3 


10 


(-> 


SPM501 


orf6*::attlf2hyg+ 


14 


(-> 


SPM502 


In-phase deletion orf6*::attl 


14 


(+) 


SPM507 


orfl2::att3Qaac- 


11 




SPM508 


orfl 3c::att3C2aac- 


12 


(+) 


SPM509 


orfl 4: :att3Qaac- 


13 
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These results make it possible to draw a certain number of conclusions regarding 
the function of the various genes involved in the spiramycin biosynthesis. Thus, the 
or/3 gene is essential to spiramycin biosynthesis. Specifically, an in-phase inactivation 
in this gene leads to a strain (OS49.67, (cf. example 6)) which no longer produces 
5 spiramycin. The in-phase inactivation makes it possible to discard the hypothesis of a 
possible influence of the cassette introduced on the expression of the genes located 
downstream of or/3. 

Similarly, the or/8 and orflO genes encode proteins essential to spiramycin 
biosynthesis since the strains OS49.107 and OS49.50 have a nonproducer phenotype. In 
10 addition, in these latter two strains, it is clearly the inactivation of the corresponding 
gene which is responsible for this nonproducer phenotype, since, in view of the 
orientation of the various orfs (cf. figure 3), the construct introduced cannot have a 
polar effect. 

The study of the strains having an excisable cassette also makes it possible to 
15 draw a certain number of conclusions regarding the function of the interrupted gene. 
The strain SPM507 has the genotype: orfl2:;att3Qaac-. In view of the orientation of 
the genes (cf. figure 3), there is no point in excising the cassette to study the effect of 
the inactivation of orfl2. The fact that orfl3c is oriented in the opposite direction to 
orfl2 shows that these genes are not co transcribed. The use of an excisable cassette 
20 makes it possible, on the other hand, to have the possibility of being rid of the selection 
marker at any moment. The phenotype of the strain SPM507 is nonproducer; it may 
therefore be concluded therefrom that the or/12 gene is a gene essential to spiramycin 
biosynthesis in S. ambofaciens. 

The strain SPM508 has the genotype: orfl3c::att3£2aac-. In view of the 
25 orientation of the genes (cf. figure 3), there is no point in excising the cassette to study 
the effect of the inactivation of or/1 3c. The fact that or/14 is oriented in the opposite 
direction to or/1 3c shows that these genes are not cotranscribed. The use of an excisable 
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cassette makes it possible, on the other hand, to have the possibility of being rid of the 
selection marker at any moment. The phenotype of the strain SPM508 is producer; it 
may therefore be concluded therefrom that the orfl 3c gene is not a gene essential to 
spiramycin biosynthesis in S. ambofaciens. 

The strain SPM509 has the genotype: orfl4::att3Qaac-. In view of the orientation 
of the genes (cf. figure 3), there is no need to excise the cassette to study the effect of 
the inactivation of orfl4\ the fact that orflSc is oriented in the opposite direction to 
orfl 4 shows that these genes are not cotranscribed. The use of an excisable cassette 
makes it possible, on the other hand, to have the possibility of being rid of the selection 
marker at any moment. The phenotype of the strain SPM509 is nonproducer; it may 
therefore be concluded therefrom that the orfl 4 gene is a gene essential to spiramycin 
biosynthesis in S. ambofaciens. 

The strain SPM21 has the genotype: orf2::att3Qaac-. This strain has a spiramycin 
nonproducer phenotype. However, the orientation of the genes orfl to or/8 (cf. figure 3) 
implies that these genes are cotranscribed. Thus, the phenotype observed may be due to 
a polar effect of the cassette introduced into orf2 on the expression of genes located 
downstream in the operon. The strain SPM22 has the genotype orf2::att3 and was 
obtained after in-phase excision of the cassette introduced. The excision of the cassette 
leaves only a characteristic "scar" sequence (cf. example 10). Since the strain SPM22 
also has a nonproducer phenotype, it may be concluded therefrom that the or/2 gene is a 
gene essential to spiramycin biosynthesis in S. ambofaciens. Only the effect due to the 
inactivation of orfl is observed here. 

The strain SPM501 has the genotype: orf6*::attlf2hyg+. This strain has a 
spiramycin nonproducer phenotype. However, since the orf5* and orf6* genes 
(cf. figure 3) have the same orientation, the phenotype observed may be due to a polar 
effect of the cassette introduced into orf6* on the expression of orf5*. The arrangement 
of these genes implies that they may be cotranscribed. To reply to this question, the 
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strain SPM502 was obtained after in-phase excision of the cassette introduced. In this 
strain, only the effect of the inactivation of or/6* is observed. This strain has the 
genotype orf6*::attl (cf. example 14). Excision of the cassette leaves only an in-phase 
"scar" sequence (cf. example 14). The strain SPM502 has a producer phenotype 
5 (however, this strain produces only spiramycin I (cf*. example 16)). It may therefore be 
concluded that the orf5* gene is a gene essential to spiramycin biosynthesis in 
S. ambofaciens, since indirect inactivation thereof in the strain SPM501 leads to a 
nonproducer phenotype. On the other hand, the or/6* gene is not a gene essential to the 
biosynthesis of spiramycin I in S. ambofaciens (on the other hand, it is essential to the 
10 production of spiramycin II and III (cf. example 16)). 



EXAMPLE 16. Assaying of the production of spiramycins I, II and III in the 
mutant strains obtained 

The various strains to be tested were each cultured in seven 500 ml baffled 
15 Erlenmeyers containing 70 ml of MPS medium (Pernodet et aL y 1993). The 
- Erlenmeyers were inoculated with 2.5 x 10 6 spores/ml of the various strains of 
5. ambofaciens and grown at 27°C with orbital shaking at 250 rpm for 72 hours. The 
cultures corresponding to the same clone were pooled, optionally filtered through a 
pleated filter, and centrifuged for 15 min at 7000 rpm. The various supernatants were 
20 then stored at -30°C 

The assays were carried out by ion -pairing high performance liquid 
chromatography (HPLC). The HPLC analysis of the culture medium makes it possible 
to determine precisely the concentration of the three forms of spiramycin. The column 
used (Macherey-Nagel) is filled with a Nucleosil octyl grafted silica phase. The particle 
25 size is 5 ^im and the pore size is 100 A. The internal diameter of the column is 4.6 mm 
and it is 25 cm long. The mobile phase is a 70/30 (v/v) mixture of H3PO4 buffer (pH 
2.2) and acetonitrile containing 6.25g/L of NaC10 4 perchlorate. The analysis is carried 
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out in an isocratic system with a flow rate fixed at 1. ml/min. The column is 
thermocontrolled at 23°C. Detection is by UV spectrophotometry at 238 nm. The 
sample is refrigerated at +10°C and the quantification is determined from the area of the 
peaks (by external calibration). Under these conditions, the retention times for 
spiramycin I, II and III are, respectively, approximately 17; 21 and 30 minutes, as could 
be verified using a commercial sample comprising the three forms of spiramycin. 

The strain OSC2 has a spiramycin producer phenotype. It is the parental strain 
used to obtain the strains having an excisable cassette (cf. example 1 5). This strain was 
therefore used as a positive control for production of the three forms of spiramycin. 
This strain clearly produces the three forms of spiramycin, as was verified by HPLC (cf. 
figure 19). 

The study of the strains having an excisable cassette makes it possible to draw a 
certain number of conclusions regarding the function of the interrupted genes. The 
strain SPM507 has the genotype: orfl2::att3Qaac-. The phenotype of the strain 
SPM507 is nonproducer (cf. example 15); it may therefore be concluded therefrom that 
the orfl2 gene is a gene essential to spiramycin biosynthesis in S. ambofaciens. This 
strain no longer produces spiramycins, as was verified by HPLC (cf. figure 22). This 
result confirms the essential nature of the orfl2 gene in spiramycin biosynthesis. 

The strain SPM508 has the genotype: orfl3c::att3C2aac-. The strain SPM508 has 
a spiramycin producer phenotype (cf. example 15); it may therefore be concluded 
therefrom that the orfl3c gene is not a gene essential to spiramycin biosynthesis in 
S. ambofaciens. This strain produces spiramycin I, II and III, as was verified by HPLC 
(cf. figure 23). This result confirms that the orfl3c gene is not a gene essential to the 
biosynthesis of spiramycins I, II and III in S. ambofaciens. 

The strain SPM509 has the genotype: orfl4::att3naac-. The phenotype of the 
strain SPM509 is nonproducer; it may therefore be concluded therefrom that the orfl4 
gene is a gene essential to spiramycin biosynthesis in S. ambofaciens. This strain no 
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longer produces spiramycins, as was verified by HPLC (cf. figure 24). This result 
confirms the essential nature of the or/14 gene in spiramycin biosynthesis. 

The strain SPM501 has the genotype: orf6*::att]f2hyg+. This strain has a 
spiramycin nonproducer phenotype. This strain no longer produces spiramycins, as was 
5 verified by HPLC (cf. figure 20). However, since the or/5* and orf6* genes (cf. figure 
3) have the same orientation, the phenotype observed may be due to a polar effect of the 
cassette introduced into orf6* on the expression of orf5* in the operon. This implies 
that these genes are contranscribed. To reply to this question, the strain SPM502 was 
obtained by excision of the cassette introduced, producing an in-phase deletion in the 

10 or 6* gene and restoring the expression of or/5*. This strain has the genotype 
orf6*::attl (cf. example 14 and 15). Excision of the cassette leaves only an in-phase 
"scar" att sequence (cf. example 14). The strain SPM502 has a spiramycin producer 
phenotype. However, as was proved by HPLC, this strain produces only spiramycin I 
and does not produce spiramycin II and III (cf. figure 21). It may therefore be concluded 

15 from these results that the or/5* gene is a gene essential to spiramycin biosynthesis in 
S. ambofacienS) since indirect inactivation thereof in the strain SPM501 leads to a 
spiramycin nonproducer phenotype, (cf. figure 20). On the other hand, the or/6* gene is 
not a gene essential to biosynthesis of spiramycin I in S. ambofaciens, since the 
inactivation of this gene leads to a spiramycin I producer phenotype (cf. figure 21). 

20 However, orf6* is essential to the production of spiramycin II and III (cf. example 16)). 
The or/6* 1 gene therefore clearly encodes an acyl transferase responsible for the 
modification of the platenolide at position 3 (cf. figure 1). 

EXAMPLE 17: Determination of the translation initiation point of orf 10 and 
25 improvement of spiramycin production 



17.1 Construction of the plasmids pSPM523. dSPM524 and oSPM525: 



148 

The orflO gene was identified in Streptomyces ambofaciens and was named 
srmR by Geistlich et al (M. Geistlich et al, 1992). Inactivation of the orflO gene was 
carried out (cf. example 8). It was thus possible to show that the resulting strain no 
longer produces spiramycins (cf example 15). This confirms that the or/70 gene is 
5 clearly involved in spiramycin biosynthesis. The protein encoded by this gene is 
therefore clearly essential to spiramycin biosynthesis. However, analysis of the 
sequence shows that two ATG codons located in the same reading frame may be used 
for the translation of orflO (cf. Figure 28). One of the two possible codons (the most 
upstream codon) begins at position 10656 of the sequence given in SEQ ID No. 1, 
10 whereas the other possible codon, located more downstream, begins at position 10809 
(cf. SEQ ID No. 1). Before testing a possible effect of the overexpression of srmR on 
spiramycin production, it is important to first determine the translation initiation point. 

With the aim of determining the translation iniation site, three constructs 
comprising three forms of orflO were produced. These forms were obtained by PCR 
15 using oligonucleotides comprising either a Hindlll restriction site or a BamUl 
restriction site. 

The first pair used for the amplification corresponds to the following oligonucleotides: 
EDR39: . /*' 

5 'CCCAAGCTTGAGAAGGGAGCGGACATTCATGGCCCGCGCCGAACGC3 ' 
20 (SEQ ID No. 1 22) (the Hindlll site is shown in bold) 
EDR42: 

5 'CGGGATCCGGCTGACCATGGGAGACGGGCGCATCGCCGAGTTCAGC3 ' 
(SEQ ID No. 123) (the BamHl site is shown in bold) 

The pair of primers EDR39-EDR42 allows the amplification of an fragment of orflO 
25 comprising the ATG located in the most 3' position (position 10809 in the sequence 
given in SEQ ID No. 1) (cf. Figure 28). The fragment obtained is approximately 2 kb in 
size and will subsequently be referred to as "short orfl0"\ it does not contain the orflO 
promoter. This 2kb fragment was cloned into the vector pGEM-T easy, to give the 
plasmid pSPM520. 
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The second pair used for the amplification corresponds to the following 

oligonucleotides: 

EDR40: 

5 ' CCC AAGCTTG AG AAGGG AGCGG AC ATTC AATGCTTTGGTAAAGC AC3 ' 
5 (SEQ ID No. 1 24) (the Hindlll site is shown in bold) 
EDR42: 

5 ' CGGG ATCCGGCTG ACC ATGGG AG ACGGGCGC ATCGCCG AGTTC AGC3 ' 
(SEQ ID No. 123) (the BamHl site is shown in bold) 

The pair of primers EDR40-EDR42 allows the amplification of a fragment of orflO 
10 comprising the ATG located in the most 5' position (position 10656 in the sequence 
given in SEQ ID No. 1) (cf. Figure 28). This 2.2 kb fragment, subsequently referred to 
as "long or/10", was cloned into the vector pGEM-T easy, to give the plasmid 
pSPM521 ; this plasmid does not contain the orflO promoter. 

The third pair used for the amplification corresponds to the following oligonucleotides: 
15 EDR41: 

5 '-CCCAAGCTTTCAAGGAACGACGGGGTGGTCAGTC AAGT-3 ' 

(SEQ ID No. 125) (the Hindlll site is shown in bold) 

EDR42: 

5 'CGGGATCCGGCTGACC ATGGGAGACGGGCGCATCGCCGAGTTCAGC3 ' 

20 (SEQ ID No. 123) (the BamHl site is shown in bold) 

The pair of primers EDR41-EDR42 allows the amplification of the orflO gene with the 
two ATGs, and also its own promoter (cf. figure 28). This 2.8 kb fragment, 
subsequently referred to as "pro or/10", was cloned into the vector pGEM-T easy, to 
give the plasmid pSPM522. 

25 The "pro or/10" fragment was obtained using as matrix the chromosomal DNA of the 
strain OSC2. The "short or/10" and "long or/10" fragments were, themselves, obtained 
using as matrix the DNA of the "pro or/10" fragment purified beforehand. 

The Hindlll-BamHl inserts of the plasmids pSPM520, pSPM521 and pSPM522 
were then subcloned into the vector pUWL201 (plasmid derived from the plasmid 
30 pUWL199 (U.F. Wehmeier, 1995) in to which the Kpnl-BamHl fragment of the region 
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of the ermE promoter (cf. Bibb et aL 9 1985, in particular figure 2) carrying a mutation 
which increases the strength of the promoter (ermE* promoter) (Bibb et al, 1994) has 
been introduced (cf. Doumith et aL, 2000)) predigested with the Hindlll-BamHl 
enzymes. Thus, three plasmids were obtained: pSPM523 (derived from pUWL201 with 
5 the "short or/10" form as insert), pSPM524 (derived from pUWL201 with the "long 
or/10" form as insert) and pSPM525 (derived from pUWL201 with the "pro or/10" 
form) (Figure 28). 

17.2 Transformation of the strain OS49.50 with the constructs pSPM523. 
PSPM524 and pSPMS25: 

10 The strain OS49.50 (strain with a knockout in the or/10 gene, cf. example 8) 

was transformed independently by protoplast transformation (T. Kieser et aL, 2000) 
with each of the plasmids pSPM523, pSPM524 and pSPM525. A negative control was 
also prepared by transforming the strain OS49.50 with the plasmid pUWL201 without 
insert. After protoplast transformation, the clones were selected for their thiostrepton 

15 resistance. The transformation of the clones with each of the plasmids was verified by 
extraction of these plasmids. Thus, four new strains were obtained: the strain OSC49.50 
pUWL201, derived from the transformation with the plasmid pUWL201 without insert; 
the strain OSC49.50 pSPM523, derived from the transformation with the plasmid 
pSPM523; the strain OSC49.50 pSPM524, derived from the transformation with the 

20 plasmid pSPM524; and, finally, the strain OS49.50 pSPM525, derived from the 
transformation with the plasmid pSPM525. 

The spiramycin production of each of these four strains was tested by HPLC. 
For this, the various strains of Streptomyces to be tested were cultured in 500 ml 
Erlenmeyers with baffles (baffled Erlenmeyers) containing 70 ml of MP5 medium 
25 (Pernodet et aL, 1993). When the strain contains the plasmid pUWL201, or one of its 
derivatives, 5 |ig/ml of thiostrepton are added. The baffled Erlenmeyers were inoculated 
at an initial concentration of 2.5 x 10 6 spores/ml of the various strains of S. 
ambofaciens, and the cultures were incubated at 27°C with orbital shaking at 250 rpm 
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for 96 hours. The cells were then separated from the medium by centrifugation and the 
supernatant was analyzed by HPLC (cf. example 16) in order to determine the amount 
of spiramycin produced. By virtue of a standard sample and measuring the area of the 
peaks, it was possible to determine the amount of each of the spiramycins produced by 
these strains. The results of this analysis are given in table 39, the data are expressed in 
mg per liter of supernatant. The results correspond to the total production of 
spiramycins (obtained by adding the production of spiramycin I, II and III). 

Table 39. Spiramycin production of the strains derived from OS49.50, (results 
expressed in mg/1). 



Strain 


Spiramycin production 


OS49.50 
pUWL201 


0 


OS49.50 
pSPM523 


0 


OS49.50 
pSPM524 


93 


OS49.50 
pSPM525 
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As shown by the results given in table 39, the negative control (strain OS49.50 
transformed with the plasmid pUWL201) does not produce spiramycin. When the 
plasmid pSPM523 (which contains the "short or/70" form) is introduced into the strain 
OS49.50, no spiramycin production is observed. On the other hand, the presence of the 
plasmid pSPM524 (which contains the "long or/10" form) and of the plasmid 
pSPM525 (which contains the "pro or/10" form) restores the spiramycin production in 
the host strain OS49.50. Thus, only the or/10 fragments containing the most upstream 
ATG make it possible to restore spiramycin synthesis. 
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With the aim of confirming these results, the plasmid pSPM521 (plasmid 
pGEM-T easy containing the "long or/70" form) was digested with the Xhol restriction 
enzyme (this enzyme has a unique site in this plasmid, located between the two ATGs 
(cf. Figure 28)). The Xhol ends were then blunt-ended by treatment with the Klenow 
5 enzyme. The plasmid was then closed back up on itself by the action of T4 DNA ligase, 
to give the plasmid pSPM527. If the most upstream ATG (position 10656 in the 
sequence given in SEQ ID No. 1) is indeed used as translation initiation site, this 
treatment will lead to a shift in the reading frame at the Xhol site and will have the 
effect of producing a protein exhibiting no activating acivity. On the other hand, if the 
10 translation initiation takes place at the most downstream ATG (position 10809 in the 
sequence given in SEQ ID No. 1), this treatment should have little or no effect on the 
expression of OrflO (given the location of the transcription initiation point) and no 
effect on the protein produced. 

The BamHl-Hindlll insert of pSPM527 was then subcloned into the vector 
15 pUWL201, to give the plasmid pSPM528. This plasmid was introduced into the strain 
OS49.50 and a clone having the desired plasmid was more particularly selected. The 
spiramycin production of the resulting strain was then tested by HPLC (cf. example 1 6 
and above). Unlike what was observed with the plasmid pSPM524 (cf. table 39), the 
presence of the plasmid pSPM528 in the strain OS49.50 does not reestablish the 
20 spiramycin production. This confirms that the translation initiation of the orflO gene is 
the ATG located in the most downstream position (ATG 1 cf. Figure 28). 

17.3 Improvement in spiramycin production of the S. ambofaciens strain 

OSC2: 

In order to test the effect of overexpression of the orflO gene on spiramycin 
25 production, the plasmids pSPM523, pSPM524, pSPM525 and pSPM528 were 
introduced into the strain OSC2. For this, protoplasts of the strain OSC2 were 
transformed (T. Kieser et aL, 2000) independently with each of the plasmids pSPM523, 
pSPM524, pSPM525 and pSPM528. A negative control was also produced by 
transforming the strain OSC2 with the plasmid pUWL201 without insert. After 
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protoplast transformation, the clones were selected for their thiostrepton resistance. 
Thus, five new strains were obtained: the strain OSC2 pUWL201, derived from the 
transformation with the plasmid pUWL201 without insert, the strain OSC2 pSPM523, 
derived from the transformation with the plasmid pSPM523; the strain OSC2 
5 pSPM524, derived from the transformation with the plasmid pSPM524; the strain 
OSC2 pSPM525, derived from the transformation with the plasmid pSPM525; and, 
finally, the strain OSC2 pSPM528, derived from the transformation with the plasmid 
pSPM528. The spiramycin production of these strains was then analyzed by HPLC (in 
the same way as in example 17.2). Analysis of the spiramycin production of the strain 
10 OSC2 was also carried out in parallel for comparison. The results of this analysis are 
given in table 40, the data are expressed in mg per liter of supernatant. The results 
correspond to the total production of spiramycins (obtained by adding the production of 
spiramycin I, II and III). 



Table 40. Spiramycin production of the strains derived from OSC2, (results 
expressed in mg/1). 



Strain 


Spiramycin production 


OSC2 


69 


OSC2 
pUWL201 


103 


OSC2 
pSPM523 


19 


OSC2 
pSPM524 


135 
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OSC2 
pSPM525 


278 


OSC2 
pSPM528 


68 



Thus, it is observed that the presence of the plasmid pSPM524 or of the plasmid 
pSPM525 significantly increases the spiramycin production of the strain OSC2. This 
clearly demonstrates that overexpression of OrflO has a positive effect on the 
spiramycin production. On the other hand, the plasmid pSPM528 has no effect on the 
spiramycin production. 

In the same way, the plasmids pSPM525 and pUWL201 were introduced into 
the strain SPM502 (cf. example 14). Thus, two new strains were obtained: the strain 
SPM502 pUWL201, derived from the transformation with the plasmid pUWL201 
without insert; and the strain SPM502 pSPM525, derived from the transformation with 
the plasmid pSPM525. 

A sample of the strain SPM502 pSPM525 (this strain contains the plasmid 
pSPM525, cf. above) was deposited with the Collection Nationale de Cultures de 
Microorganismes (CNCM) Pasteur Institute, 25, rue du Docteur Roux 75724 Paris 
Cedex 15, France, on February 26, 2003 under the registration number 1-2977. 

The spiramycin production of the strains SPM502 pUWL201 and SPM502 
pSPM525 was analyzed by HPLC (in the same way as in example 17.2). Analysis of the 
spiramycin production of the strain SPM502 was also carried out in parallel for 
comparison. The results of this analysis are given in table 41, the data are expressed in 
mg per liter of supernatant. The results correspond to the production of spiramycin I. In 
fact, none of these strains produces spiramycin II and III. 

Table 41. Spiramycin I production of the strains derived from SPM502, (results 
expressed in mg/1). 
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Strain 


Spiramycin I 


SPM502 


47 


SPM502 pUWL201 


72 


SPM502 pSPM525 


130 



Thus, it was possible to observe that overexpression of the orflO gene in the 
strain SPM502 considerably increases the production of spiramycin I. 

EXAMPLE 18: Construction of a genomic DNA library of the Streptomyces 
5 ambofaciens strain OSC2 in E. coli in the cos mid pWED2. 

1 8.1 Construction of the cosmid pWED2: 

With the aim of facilitating the inactivation of genes in Streptomyces, a cosmid 
carrying the oriT sequence of the plasmid RK2 (which allows its introduction by 
conjugation into Streptomyces from a suitable strain of E. coli) and also carrying a gene 

10 for resistance to a an antibiotic imparting a detectable phenotype in Streptomyces was 
constructed. Such a cosmid containing large inserts of genomic DNA of Streptomyces 
ambofaciens can be used in gene inactivation experiments. 

To construct this vector, a pac-oriT cassette (EcoRV fragment) was introduced 
into the cosmid pWEDl (Gourmelen et al. 9 1998), derived from the cosmid pWED15 

15 (Wahl et al., 1987), at the unique Hpa\ site. The pac-oriT cassette was obtained by 
PCR. For this, the pac gene was amplified by PCR from the plasmid pVF 10.4 (Vara et 
aL, 1985; Lacalle et al, 1989) using, as first primer, primer A (of sequence 5'- 
CC AGTAGATATCCCGCCAACCCGGAGCTGC AC-3 ' (SEQ ID No. 126), the 
EcoRV restriction site has been underlined and the 20 nucleotide sequence in bold 
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corresponds to a region upstream of the promoter of the pac gene) and, as second 
primer, primer B (of sequence 

S^ GAAAAGATCCGT CATGGGGTrnTnrnTTrrTT-V (SEQ ID No. 127), 
which comprises, at its 5' end, a 12 nucleotide sequence corresponding to the start of 
5 the oriT sequence (double underlined) and a 20 nucleotides sequence (in bold) 
corresponding to the end of the pac gene (cf. figure 29, 1st PCR). 

As regards the oriT gene, it was amplified by PCR from the plasmid pPM803 
(P. Mazodier et aL 9 1989) using, as first primer, primer C (of sequence 
5 '-CACGACCCCATGACGGATCTTTTCCGCTGCAT-3 ' (SEQ ID No. 1 28)), which 

10 comprises, at its 5' end, a 12 nucleotide sequence corresponding to a sequence 
downstream of the coding sequence of the pac gene (in bold) and a 20 nucleotide 
sequence corresponding to the start of the oriT sequence and, as second primer, primer 
D (of sequence 5 '-GAGCCG GATATCATCGGTCTTGCCTTGCTCGT -3 ' (SEQ ID 
No. 129)), which comprises the EcoKV restriction site (single underlined) and a 20 

15 nucleotide sequence corresponding to the end of the oriT sequence (double underlined) 
(cf. Figure 29, 2nd PCR). 

The amplification product obtained with the primers A and B and that obtained 
with primers C and D have, at one; of their ends, a common sequence of 24 nucleotides. 
A third PCR was carried out by mixing the two amplification products previously 

20 obtained and using, as primers, primers A and D (cf. figure 29, 3rd PCR). This made it 
possible to obtain an amplification product corresponding to the combination pac+oriT. 
This pac-oriT fragment was then cloned into the vector pGEM-T Easy (marketed by the 
company Promega (Madison, Wisconsin, USA)), which made it possible to obtain the 
plasmid pGEM-T -pac-oriT. The insert of this plasmid was then subcloned into the 

25 cosmid pWEDl. For this, the plasmid pGEM-pac-oriT was digested with the EcoKV 
enzyme and the EcoKV insert containing the combination pac+oriT was inserted into 
the cosmid pWEDl opened beforehand with the Hpa\ enzyme. The cosmid thus 
obtained was named pWED2 (cf. figure 30). 

This cosmid makes it possible to facilitate the inactivation of genes in 

30 Streptomyces. Specifically, it carries the oriT sequence, which allows its introduction by 
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conjugation into Streptomyces from a suitable strain of E. coli, but also a gene for 
resistance to an antibiotic imparting a detectable phenotype in Streptomyces. Such a 
cosmid containing large inserts of genomic DNA of Streptomyces ambofaciens can be 
used in gene inactivation experiments. 
5 Thus, a cosmid derived from pWED2 containing the target gene may, for 

example, be introduced into the E. coli strain KS272 containing the plasmid pKOBEG 
(cf. Chaveroche et ah 2000) and a cassette will be introduced into the target gene 
according to the technique described by Chaveroche et ah 2000. The cosmid obtained 
by this technique (cosmid in which the target gene is inactivated) may then be 
10 introduced into an E. coli strain such as the SI 7.1 strain or any other strain making it 
possible to transfer plasmids containing the oriT sequence to Streptomyces by 
conjugation. 

After conjugation between E. coli and Streptomyces, clones of Streptomyces in 
which the wild-type copy of the target gene will have been replaced with the interrupted 
15 copy may be obtained as described in example 2. 

The resistance gene expressed in Streptomyces, present on this new cosmid, is - 
the/Kzc gene of Streptomyces alboniger (J. Vara et ah, 1985; Lacalle et ah, 1989), 
which encodes puromycin N-acetyltransferase and which imparts puromycin resistance. 
In gene inactivation experiments, the clones in which a double recombination event has 
20 taken place will be sought. These clones will have eliminated the cosmid pWED2 and 
will therefore have become sensitive to puromycin again. 

18.2 C onstruction of a genomic DNA library of the Streptomyces ambofaciens 
strain OSC2 in E, coli in the cosmid pWED2 

The genomic DNA of the Streptomyces ambofaciens strain OSC2 was partially 
25 digested with the BamHI restriction enzyme so as to obtain DNA fragments of between 
approximately 35 and 45 kb in size. These fragments were then cloned into the cosmid 
pWED2, the latter having been digested beforehand with BamHI, and then treated with 
alkaline phosphatase. The ligation mixture was then encapsidated in vitro in lambda 
phage particles using the "Packagene® Lambda DNA packaging system" marketed by 
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the company Promega (Madison, Wisconsin, USA) according to the manufacturer's 
recommendations. The phage particles obtained were used to infect the £*. coli strain 
SURE® marketed by the company Stratagene (LaJolla, California, USA). The clones 
were selected on LB medium + ampicillin (50 ng/ml), the cosmid pWED2 imparting 
ampicillin resistance. 

EXAMPLE 19: Isolation of cosmids of the new library covering the region of the 
biosynthetic pathway for spiramycins. Subcloning and sequencing of fragments of 
these cosmids. 

19.1 Hy bridization on colonies of the genomic library of Streptomvces 
ambofaciens OSC2: 

Cosmids of the new library of Streptomyces ambofaciens OSC2 (cf. example 
1 8) covering orfl * to or/10* or a part or all of orfl to orf25c 9 or a region more upstream 
of orf25c, were isolated. For this, successive hybridizations on the colonies of E. coli 
obtained according to example 18 were performed using the following 3 probes 
(cf. figure 31): ; 

- The first probe used corresponds to a DNA fragment of approximately 0.8 kb 
amplified by PCR using as matrix the cosmid pSPM5, and the following primers: 
ORF23c: 5'-ACGTGCGCGGTGAGTTCGCCGTTGC-3 ' (SEQ ID No. 130) and 
ORF25c: 5 ' -CTG AACG ACGCC ATCGCGGTGGTGC-3 ' (SEQ ID No. 131). 

The PCR product thus obtained contains a fragment of the start of orf23c f orf24c in its 
entirety and the end of orf25c (cf. figure 3 1 , probe I). 

- The second probe used corresponds to a DNA fragment of approximately 0.7 kb 
amplified by PCR using as matrix the total DNA of the S. ambofaciens strain 
ATCC23877, and the following primers: 
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ORFl*c: 5'-GACCACCTCGAACCGTCCGGCGTCA-3' (SEQ ID No. 132) and 
ORF2*c: 5'-GGCCCGGTCCAGCGTGCCGAAGC-3' (SEQ ID No. 133). 
The PCR product thus obtained contains a fragment of the end of orfl*c and of the start 
of orf2*c (cf. figure 3 1 , probe II). 

- The third probe used corresponds to an EcoRl-BamHl fragment of approximately 3 kb 
containing orfl, orf2 and or/3 and obtained by digestion of the plasmid pOS49.99 
(cf. figure 31, probe III). 

Approximately 2000 clones of the library obtained in example 18.2 were 
transferred onto a filter for colony hybridization according to conventional techniques 
(Sambrook et ai, 1989). 

The first probe (cf. figure 31, probe I) was labeled with 32 P by the random 
priming technique (kit marketed by the company Roche) and used for the hybridization 
on 2000 clones of the library, after transfer onto a filter. The hybridization was carried 
out at 65°C in the buffer described by Church & Gilbert (Church & Gilbert, 1984). Two 
washes were carried out in 2 x SSC, 0,1% SDS at 65°C, the first for 10 minutes and the 
second for 20 minutes, and a third wash lasting 30 minutes was then carried out in 0.2X 
SSC, 0.1% SDS at 65°C. Under these hybridization and washing conditions, 20 clones 
out of the 2000 hybridized exhibited a strong hybridization signal with the first probe. 
These 20 clones were cultured in LB medium + ampicillin (50 ug/ml) and the 
corresponding 20 cosmids were extracted by alkaline lysis (Sambrook et ai, 1989) and 
then digested with the BamRl restriction enzyme. The digestion products were then 
separated on agarose gel, transferred onto a nylon membrane and hybridized with the 
first probe (cf. above: the PCR product ORF23c-ORF25c, probe I) under the same 
conditions as above. Twelve cosmids contained a BamHl fragment which hybridized 
strongly with the probe used. These 12 cosmids were named pSPM34, pSPM35, 
PSPM36, pSPM37, pSPM38, pSPM39, pSPM40, pSPM41, pSPM42, pSPM43, 
pSPM44 and pSPM45. The profiles, after digestion with BamHI, of these 12 cosmids 
were compared with one another and with that of the cosmid pSPM5. In addition, PCR 
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amplification experiments using various primers corresponding to various genes already 
identified in the orfl-or/25c region made it possible to position the insert of some of 
these cosmids with respect to one another and to also determine the location of these 
inserts in the already known orfl-orf25c region (cf. figure 32). The cosmid pSPM36 
was more particularly chosen since it was liable to contain a large region upstream of 
orf25c (cf. figures 31 and 32). 

Next, using the same conditions as those described above, the 2000 clones of 
the library of Streptomyces ambofaciens OSC2 were hybridized with the second probe 
corresponding to the PCR product: ORFl*c-ORF2*c (cf. figure 31, probe II). This 
hybridization made it possible to isolate cosmids whose insert is located in the region 
from orfl*c to orfl0*c. Under the hybridization conditions used, 16 clones out of the 
2000 hybridized exhibited a strong hybridization signal with this second probe. These 
16 clones were cultured in LB medium + ampicillin (50 ^ig/ml) and the corresponding 
16 cosmids were extracted by alkaline lysis (Sambrook et al. 9 1989) and then digested 
with the BamHl restriction enzyme. The digestion profiles (after digestion with BamHl) 
of these 16 cosmids were compared with one another and with that of the cosmid 
pSPM7. PCR amplification experiments with the primers ORFl*c and ORF2*c made it 
possible to choose two cosmids which clearly contained the orfl*c and orf2*c genes 
and the profiles of which had common bands but also different bands. In addition, other 
PCR amplification experiments using various primers corresponding to various genes 
already identified in the region orfl*c to orfl0*c made it possible to position the insert 
of these cosmids with respect to one another and to also determine the location of these 
inserts in the already known region orfl*c to orfl0*c (cf. figure 32). The two cosmids 
more particularly selected were named pSPM47 and pSPM48 (cf. Figure 32). 

Using the same conditions as those described above, the 2000 clones of the 
library of Streptomyces ambofaciens OSC2 were also hybridized with the third probe 
corresponding to the EcoRl-BamHl DNA fragment of the plasmid pOS49.99 
(cf. figure 31 probe III). This hybridization made it possible to isolate the cosmids 
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containing the region from orfl up to or/3 and liable to contain either a large part of the 
PKS genes or a large part of the genes orfl to or/25c of the biosynthetic pathway for 
spiramycins. Under these hybridization conditions, 35 clones out of the 2000 hybridized 
exhibited a strong hybridization signal with the third probe. These 35 clones were 
5 cultured in LB medium + ampicillin (50 jag/ml) and the corresponding 35 cosmids were 
extracted by alkaline lysis (Sambrook et aL, 1989) then digested with the BamHl 
restriction enzyme. The profiles, after digestion with BamHl, of these 35 cosmids were 
compared with one another and with that of the cosmid pSPM5. In addition, PCR 
amplification experiments using various primers corresponding to various genes already 

10 identified in the region orfl to orf25c made it possible to verify that these cosmids 
clearly contained inserts originating from the region orfl to orf25c and to position the 
inserts of these cosmids with respect to one another and with respect to the already 
known region orfl to orf25c (cf. figure 32). Five cosmids were more particularly 
selected, they were named pSPM50, pSPM51, pSPM53, pSPM55 and pSPM56 

15 (cf. figure 32). 



19.2 Subcloning and sequencing of a part of the insert of the cosmid pSPM36 

The probe of approximately 0.8 kb obtained by PCR with the primers ORF23c 
and ORF25c (cf. above and figure 31, probe I) was also used in Southern blotting 
experiments on the total DNA of S. ambofaciens OSC2 digested with the Pstl enzyme. 

20 Under the hybridization conditions described above, this probe reveals a single Pstl 
fragment of approximately 6 kb when hybridized on the total DNA of S. ambofaciens 
OSC2 digested with the Pstl enzyme. A Pstl site exists in the region of orf23c (cf. SEQ 
ID No. 80), but no other Pstl site exists up to the end (BamHl site) of the known 
sequence (cf. SEQ ID No. 1). This Pstl-BamHl fragment is approximately 1.4 kb in 

25 size. The 6 kb Pstl fragment hybridized on the total DNA of S. ambofaciens digested 
with the Pstl enzyme therefore contains a region of approximately 4.6 kb located 
upstream of orf25c. This region is liable to contain other genes the products of which 
are involved in the biosynthetic pathway for spiramycin. It was verified, by digestion, 
that the cosmid pSPM36 indeed contained this 6 kb Pstl fragment. This fragment was 
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isolated from pSPM36, with the aim of determining the sequence further upstream of 
or/25c. For this, the cosmid pSPM36 was digested with the Pstl restriction enzyme. The 
Pstl-Pstl fragment, approximately 6 kb in size, was isolated by electroelution from a 
0.8% agarose gel and then cloned into the vector pBK-CMV (4512 bp) (marketed by 
5 the company Stratagene (La Jolla, California, USA)). The plasmid thus obtained was 
named pSPM58 (cf. figure 33) and the sequence of its insert was determined. The 
sequence of this insert is given in SEQ ID No. 134. However, not all the sequence was 
determined and a gap of approximately 450 nucleotides remains, the part of the 
sequence undetermined was noted by a succession of "N"s in the corresponding 
10 sequence. 

19.3 A nalysis of the new nucleotide sequences, determination of the open reading 
frames and characterization of the genes involved in spiramycin biosynthesis 

The sequence of the insert of the cosmid pSPM58 obtained was analyzed using 
the FramePlot program (J. Ishikawa & K. Hotta 1999). This made it possible to identify, 

15 among the open reading frames, the open reading frames exhibiting a codon use typical 
of Streptomyces. This analysis made it possible to determine that this insert comprises 4 
new ORFs upstream of orf25c (cf. figure 34). These genes were named orf26 (SEQ ID 
No. 107), or/27 (SEQ ID No. 109), orf28c (SEQ ID No. Ill, the sequence of this orf 
was not completely determined since a gap of approximately 450 nucleotides remains in 

20 the sequencing of the insert of pSPM58, these 450 nucleotides appearing in the form of 
a series of "N"s in the sequence SEQ ID No. 1 1 1) and or/29 (the sequence of the latter 
orf was incomplete in this insert). The "c" added to the name of the gene means, for the 
ORF in question, that the coding sequence is in the reverse orientation (cf. figure 34). 

The protein sequences deduced from these open reading frames were compared 
25 with those present in various databases using various programs: BLAST (Altschul et 
al. 9 1990) (Altschul et al. 9 1997), CD-search, (these two programs are accessible in 
particular from the National Center for Biotechnology Information (NCBI) (Bethesda, 
Maryland, USA)), FASTA ((W.R. Pearson & D.J. Lipman, 1988) and (W.R. Pearson, 
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1990) (this program is accessible in particular from the INFOBIOGEN resource center, 
Evry, France). These comparisons made it possible to formulate hypotheses regarding 
the function of the products of these genes and to identify those liable to be involved in 
spiramycin biosynthesis. 

19.4 Subcloning and sequencing of another part of the insert of the cosmiH 
PSPM36. 

The probe of approximately 0.8 kb obtained by PCR with the primers ORF23c 
and ORF25c (cf. above and figure 31, probe I) was also used in Southern blotting 
experiments on the total DNA of the strain OSC2 digested with the Stul enzyme. Under 
the hybridization conditions described above for this probe, this probe reveals a single 
Stul fragment of approximately 10 kb when hybridized on the total DNA of the strain 
OSC2 digested with the Stul enzyme. Given the presence of an Stul site in orf23c 
(cf. SEQ ID No. 80) and the location of this site relative to the Pstl site, this 10 kb 
fragment includes all of the Pstl fragment previously studied (insert of pSPM58) and 
makes it possible to have access to an approximately 4 kb region not yet studied 
(cf. figure 33). It was verified, by digestion, that the cosmid pSPM36 indeed contained 
this 1 0 kb Stul fragment. This fragment was isolated from the cosmid pSPM36, with the 
aim of determining the sequence of the end of or/29 and of other genes further upstream 
of or/29. For this, the cosmid pSPM36 was digested with the Stul restriction enzyme. 
The Stul-Stul fragment, approximately 10 kb in size, was isolated by electroelution 
from a 0.8% agarose gel and then cloned into the vector pBK-CMV (4512 bp) 
(marketed by the company Stratagene (La Jolla, California, USA)). The plasmid thus 
obtained was named pSPM72 (cf. figure 33). The latter was then digested with EcoRl 
(EcoRl site in the insert of pSPM58) and HindlU (Hindlll site in the multiple cloning 
site of the vector, immediately after the Stul site of end of the insert) (cf. figure 33). The 
EcoRl-Hindlll DNA fragment thus obtained corresponds to a fragment of the insert of 
the plasmid pSPM72 (cf. figure 33) and was subcloned into the vector pBC-SK+ 
(marketed by the company Stratagene (La Jolla, California, USA)) digested beforehand 
with EcoRl and Hindlll. The plasmid thus obtained was named pSPM73 and the 
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sequence of its insert was determined. The sequence of this insert is given in SEQ ID 
No. 135. 

An assembly of the sequences of the inserts of pSPM58 and pSPM73 is given in SEQ 
ID No. 106. This sequence starts from the Pstl site in or/23c (cf. SEQ ID No. 80) and 
continues to the Stul site in orf32c (cf. figure 34). Since the sequence of orf28c (SEQ 
ID No. Ill) is not complete (cf. example 19.3), a region of approximately 450 
nucleotides is not sequenced, these 450 nucleotides appear in the form of a series of 
"N"s in the sequence SEQ ID No. 106. 

19.5 Analysis of the new n ucleotide sequences, determination of the op en reading 
frames and characterization of t he genes involved in spiramycin biosynthesis 

The partial sequence of the insert of the cosmid pSPM73 obtained was analyzed 
using the FramePlot program (J. Ishikawa & K. Hotta, 1999). This made it possible to 
identify, among the open reading frames, the open reading frames exhibiting a codon 
use typical of Streptomyces. This analysis made it possible to determine that this insert 
comprises 4 ORFs, one incomplete and three complete (cf. figure 34). The incomplete 
ORF corresponds to the 3' portion of the coding sequence of or/29, which made it 
possible to complete the sequence of this gene by virtue of the partial sequence of this 
same orf obtained during the sequencing of the insert of the plasmid pSPM58 
(cf. examples 19.2 and 19.3); the combination of the two sequences thus made it 
possible to obtain the complete sequence of or/29. The 4 genes were thus named or/29 
(SEQ ID No. 113), or/30c (SEQ ID No. 115). or/31 (SEQ ID No. 1 18) and orf32c (SEQ 
ID No. 120). The "c" added to the name of the gene means, for the ORF in question, 
that the coding sequence is in the reverse orientation (cf. figure 34). 

The protein sequences deduced from these open reading frames (SEQ ID No. 1 14 
for or/29, SEQ ID No. 1 16 and 1 17 for or/30c, SEQ ID No. 1 19 for or/31 and SEQ ID 
No. 121 for or/32c) were compared with those present in various databases using 
various programs: BLAST (Altschul et al., 1990) (Altschul et al., 1997), CD-search, 
(these two programs are accessible in particular from the National Center for 
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Biotechnology Information (NCBI) (Bethesda, Maryland, USA)), FASTA ((W.R. 
Pearson & D.J. Lipman, 1988) and (W.R. Pearson, 1990) (this program is accessible in 
particular from the INFOBIOGEN resource center, Evry, France). These comparisons 
made it possible to formulate hypotheses regarding the function of the products of these 
5 genes and to identify those liable to be involved in spiramycin biosynthesis. 

19.6 Subcloning and sequencing of a third portion of the insert of the cosmid 
PSPM36. 

A probe (0.8 kb DNA fragment) corresponding to a sequence internal to or/3 2c 
was obtained by PCR using, as matrix, the total DNA of the Streptomyces ambofaciens 
10 strain and the following primers 

KF36: 5'- TTGCCGTAGCCGAGGACCAGCG-3' (SEQ ID No. 151) and 
KF37: 5*- CACATGGCCCTGGAGGACCCTG-3 1 (SEQ ID No. 1 52). 

The PCR product thus obtained represents an internal sequence of or/32c. This 
probe was used in Southern blotting experiments on the chromosomal DNA of the 

15 strain OSC2 and on the DNA of the cosmid pSPM36, digested with the Pstl enzyme. 
Using the same hybridization conditions as those described above (cf. example 19.1), 
this probe reveals two Pstl fragments of approximately 3.4 kb and 2.5 kb when 
hybridized on the total DNA of the strain OSC2 and on the DNA of the cosmid 
pSPM36, digested with the Pstl enzyme. Given the presence of a Pstl site in the probe 

20 used, these results can be explained. The first DNA fragment, which has a size of 
approximately 3.4 kb, is a fragment the sequence of which is already entirely known. 
The sequence of the 2.5 kb fragment is only partially known, over a region of 700 bp. 
This fragment was isolated from the cosmid pSPM36 with the aim of determining the 
sequence of the end of or/32c and other genes upstream of the latter. For this, the 

25 cosmid pSPM36 was digested with the Pstl restriction enzyme. The Pstl-Pstl fragment, 
approximately 2.5 kb in size, was isolated by purification from a 0.8% agarose gel and 
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then cloned into the vector pBK-CMV (4518bp) (marketed by the company Stratagene 
(La Jolla 5 California, USA)). The plasmid thus obtained was called pSPM79 (cf. figure 
41) and the sequence of its insert was determined. 

The sequence of orf28c (SEQ ID No. Ill) was not complete (cf. example 19.3). 
5 Specifically, it had not been possible to determine a region of approximately 450 
nucleotides, these 450 nucleotides appear in the form of a series of "N"s in the sequence 
SEQ ID No. 106. The sequence of the entire missing region was determined by 
resequencing this region. The sequence of the inserts of pSPM58 and pSPM73 was 
therefore determined in its entirety. The complete sequence of the coding portion of 
10 orf28c is given in SEQ ID No. 141 and the protein deduced from this sequence in SEQ 
ID No. 142. The sequence of the insert of the plasmid pSPM79 is given in SEQ ID 
No. 161. 

An assembly of the sequences of the inserts of pSPM58, pSPM73 and pSPM79 is 
given in SEQ ID No. 140 (cf. figure 41). This sequence starts from the Pstl site in 
15 orf23c (cf. SEQ ID No. 80) and continues up to the Pstl site in orf34c (cf. figure 41). 

19.7 Analysis of the new nucleotide sequences, determination of the open reading 
frames and characterization of the genes which may be involved in spiramycin 
biosynthesis. 

The sequence of the insert of the plasmid pSPM79 obtained was analyzed using 
20 the FramePlot program (Ishikawa J & Hotta K. 1999). This made it possible to identify, 
among the open reading frames, the open reading frames exhibiting a codon usage 
typical of Streptomyces. This analysis made it possible to determine that this insert 
contains 3 ORFs, two incomplete (or/32c and orf34c) and one complete (or/33) 
(cf. figure 41). 

25 The first incomplete ORF corresponds to the 5' portion of the coding sequence of 

orJ32c. This made it possible to complete the sequence of this gene by virtue of the 
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partial sequence of this same orf obtained during the sequencing of the insert of the 
plasmid pSPM73 (cf. examples 19.4 and 19.5), the combination of the two sequences 
thus made it possible to obtain the complete sequence of or/3 2c. (SEQ ID No. 145). 
The "c" added to the name of the gene means, for the ORF in question, that the coding 
sequence is in the reverse orientation (cf. figure 41). The complete orf was called or/33 
(SEQ ID No. 147). The third ORF was called or/3 4c (SEQ ID No. 149). The "c" added 
to the name of the gene means, for the ORF in question, that the coding sequence is in 
the reverse orientation (cf. figure 37). The comparisons performed between the product 
of this orf and the databanks suggest that the C-terminal portion of this protein is not in 
the product deduced from the nucleotide sequence and therefore that this orf is longer 
and continues beyond the region sequenced. 

The protein sequences deduced from these open reading frames were compared 
with those present in various databases using various programs: BLAST (Altschul et al., 
1990) (Altschul et al., 1997), CD-search (these two programs are accessible in 
particular from the National Center for Biotechnology Information (NCBI) (Bethesda, 
Maryland, USA)), FASTA ((Pearson W. R. & D. J. Lipman, 1988) and (Pearson W. R, 
1990) (this program is accessible in particular from the INFOBIOGEN resource center, 
Evry, France). These comparisons made it possible to formulate hypotheses regarding 
the function of the products of these genes and to identify those which may be involved 
in spiramycin biosynthesis. 

EXAMPLE 20: Analysis of the production of spiramycin biosynthesis 
intermediates: 

20.1 Sample preparation: 

The various strains to be tested were each cultured in seven 500 ml baffled 
Erlenmeyers containing 70 ml of MP5 medium (Pernodet et al., 1993). The 
Erlenmeyers were inoculated with 2.5 x 10 6 spores/ml of the various strains of 
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S. ambofaciens and grown at 27°C with orbital shaking at 250 rpm for 96 hours. The 
cultures corresponding to the same clone were pooled, optionally filtered through a 
pleated filter, and centrifiiged for 15 min at 7000 rpm. 

The pH of the must was then adjusted to 9 with sodium hydroxide and the 
supernatant was extracted with methyl isobutyryl ketone (MIBK). The organic phase 
(MIBK) was then recovered and evaporated. The dry extract was then taken up in 1ml 
of acetonitrile, then diluted to 1/10 (100 pi qs 1 ml with water) before being used for 
the liquid chromatography/mass spectrometry (LC/MS) analyses. 

20.2 Analysis of the samples bv LC/MS: 

The samples were analyzed by LC/MS with the aim of determining the mass of 
the various products synthesized by the strains to be tested. 

The high performance liquid chromatography column used is a Kromasil C8 150* 
4.6mmm, 5mmm, 100 A column. 

The mobile phase is a gradient consisting of a mixture of acetonitrile and an aqueous 
0.05% trifluoroacetic acid solution, the flow rate is fixed at 1 ml/min. The temperature 
of the column oven is maintained at 30°C. 

The UV detection at the column outlet was carried out at two different wavelengths: 
238 nm and 280 nm. 

The mass spectrometer coupled to the chromatography column is a Single Quadripole 
device marketed by the company Agilent, with cone voltages at 30 and at 70 V. 

20.3 Analysis of the biosyn thesis intermediates produced by the strain OS4Q 67- 

The strain OS49.67 in which the or/3 gene is inactivated by an in-phase deletion 
does not produce spiramycins (cf. examples 6 and 15). 

A sample was prepared according to the method described above (cf. paragraph 20.1) 
and was analyzed by LC/MS as described above (cf. paragraph 20.2). 
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More particularly, the analysis by chromatography was carried out in a solvent gradient 
with, as mobile phase: 20% of acetonitrile from time T=0 to 5min, then linear increase 
to 30% at T=35 minutes, followed by a plateau up to T=50 minutes. 

Under these conditions, two products were more particularly observed: an 
absorbent at 238 nm (retention time of 33.4 min) and an absorbent at 280 nm (retention 
time of 44.8 min) (cf. figure 35). Figure 35 shows the superposition of the HPLC 
chromatograms produced at 238 and 280 nm (top) and also the UV spectra of the 
molecules eluted at 33.4 minutes and 44.8 minutes (bottom). 

The coupled mass spectrometry analysis conditions were as follows: the scanning 
is carried out in scan mode, covering a mass range of between 100 and 1000 Da. The 
gain of the electromultiplier was 1 V. As regards the electrospray source, the pressure of 
the nebulizing gas was 35 psig, the flow rate of the drying gas was 12.0 l.min' 1 , the 
temperature of the drying gas was 350°C, and the capillary voltage was brought to +/- 
3000 V. These experiments made it possible to determine the mass of the two products 
separated. These masses are, respectively, 370 g/mol for the product eluted first 
([M-H 2 OJ + =353 major product) and 368 g/mol for the second product ([M-H 2 0] + =351 
major product). 

In order to obtain the structure, the products mentioned above were isolated and 
purified under the following conditions: the mobile phase is a 70/30 (v/v) mixture of an 
aqueous 0.05% trifluoroacetic acid solution and acetonitrile. The chromatography is 
carried out in an isocratic system with a fixed flow rate of 1 ml/min. Under these 
conditions, the retention times of the 2 products are, respectively, 8 and 13.3 minutes. 
In addition, in this case, the sample prepared (cf. paragraph 20.1) is not diluted in water 
before injection of 10 jxl. 

The 2 products are recovered at the chromatographic column outlet and isolated 
under the following conditions: an Oasis HLB 1 cc 30 mg cartridge (Waters) is 
conditioned sequentially with 1 ml of acetonitrile, then 1 ml of water/acetonitrile 
(20v/80v) and 1 ml of 80/20 water/acetonitrile. The sample is then introduced and the 
cartridge washed successively with 1 ml of water/acetonitrile (95/5) and 1 ml of water / 
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deuterated acetonitrile (95/5), then eluted with 600 \xl of 40/60 water/deuterated 
acetonitrile. The solution recovered is then directly analyzed by NMR. 

The NMR spectra obtained for these two compounds are as follows: 
5 - First product eluted: Platenolide A: (spectrum 9646V) 

1H spectrum in CD3CN (chemical shifts in ppm): 0.90 (3H, t, J=6Hz), 0.93 (3H, d, 
J=5Hz), 1.27 (3H, d, J=5Hz), between 1.27 and 1.40 (3H, m), 1.51 (1H, m), 1.95 (1H, 
m), 2.12 (1H, m), 2.30 (1H, d, J=12Hz), 2.50 (1H, d, J=llHz), 2.58 (1H, dd, J=9 and 
12 Hz), 2.96 (1H, d, J=7Hz), 3.43 (3H, s), 3.70 (1H, d, J=9Hz), 3.86 (1H, d, J=7Hz), 
10 4.10 (1H, m), 5.08 (1H, m), 5.58 (1H, dt, J=3 and 12Hz), 5.70 (1H, dd, J=8 and 12 Hz), 
6.05 (1H, dd, J=9 and 12 Hz), 6.24 (1H, dd, J=9 and 12Hz). 

- Second product eluted: Platenolide B: (spectrum 9647V) 

1H spectrum in CD3CN (chemical shifts in ppm): 0.81 (3H, t, J=6Hz), 0.89 (1H, m), 
15 1.17 (3H, d, J=5Hz), 1.30 (4H, m), 1.47 (2H, m), 1.61 (1H, t, J=10Hz), 2.20 (1H, m), 
2.38 (1H, d, J=13Hz), 2.52 (1H, m), 2.58 (1H, m), 2.68 (1H, dd, J=8 and 13Hz), 3.10 
(1H, d, J=7Hz), 3.50 (3H, s), 3.61 (1H, d, J=8Hz), 3.82 (1H, d, J=7Hz), 5.09 (1H, m), 
6,20 (1H, m), 6.25 (1H, dd, J=9 and 12 Hz), 6.58 (1H, d, J=12 Hz), 7.19 (1H, dd, J=9 
andl2Hz). 

20 

These experiments thus made it possible to determine the structure of the these 
two compounds. The first product eluted is platenolide A and the second is platenolide 
B; the deduced structure of these two molecules is given in figure 36. 

25 It was also possible to determine, using the LC/MS technique combined with 

NMR, under conditions slightly different from those described above, but the setting up 
of which is well known to those skilled in the art, that the strain OS49.67 produces, in 
addition to platenolide A and B, a derivative of these two compounds. They are 
platenolide A+mycarose and platenolide B+mycarose (the structure of these two 
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compounds is given in figure 40). The results of analysis of the must of the strain 
OS49.67 are given in table 42. 



Table 42. Results of the LC/MS analysis of the must of the strain OS49.67 



Identity 


t ] 

(mg/l) 


Ion masses 


Max 

absorption 


Platenolide A 

Exact mass = 370 

Molecular formula 
= C20H34O6 


16.1 


[M + Na] + 
393.0 
[M + K] + 
408.9 

353.0 

[2M + Na] + 
763.2 


231 nm 


Platenolide B 

Exact mass = 368 

Molecular formula 
= C20H32O6 


1.4 


[M - H 2 0 + H] + 

351.0 

[M + Na] + 

391.0 

[M +K] + 

406.9 

[2M + Na] + 
759.1 


283 nm 


Platenolide A + 'mycarose' 
Exact mass = 514 
Molecular formula 

=C 2 7H 4 609 


4:27 


[ivi • in a j 

537.0 

[M +K] + 
553.0 


230 nm 


Platenolide B + 'mycarose' 

Exact mass = 512 

Molecular formula 
= C27H44O9 


ND 


[M + Na] + 
535.0 
[M +K] + 
550.9 

[PlatB - H 2 0 + H] + 
350.9 


284 nm 
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20.4 Analysis of the biosynthesis intermediates produced by the strain SPM509: 

The strain SPM509 in which the orfl4 gene is inactivated (orfl4::att3Qaac-) 
does not produce spiramycins (cf. examples 13, 15 and 16). A sample was prepared 
according to the method described above (cf. paragraph 20.1) and was analyzed by 
LC/MS as described above (cf. paragraphs 20.2 and 20.3). Analysis of the biosynthesis 
intermediates present in the culture supernatant of the strain SPM509 cultured in MPS 
medium showed that this strain produced only form B of platenolide ("platenolide B'\ 
cf. figure 36) and not form A ("platenolide A", cf. figure 36). 



EXAMPLE 21: Interruption of the orfl4 gene in a strain with a knockout in the 
or/3 gene (OS49.67) 

The product of the orfl4 gene is essential to spiramycin biosynthesis 
(cf. examples 13, 15 and 16: the strain SPM509 in which this gene is interrupted no 
longer produces spiramycins). Analysis of the biosynthesis intermediates present in the 
culture supernatant of the strain SPM509 cultured in MP5 medium showed that this 
strain produced only the form B of platenolide and not form A (cf. example 20). One of 
the hypotheses which may explain this observation is that the product of orfl4 is 
involved in the conversion of form B of platenolide to form A via an enzymatic 
reduction step. To test this hypothesis, the orfl4 gene was inactivated in a mutant no 
longer producing spiramycin, but producing forms A and B of platenolide. This is the 
case of the strain OS49.67 (cf. examples 6 and 20) in which the or/3 gene is inactivated 
by an in-phase deletion (Aor/3). To inactivate the or/14 gene in this strain, the plasmid 
pSPM509 was introduced by protoplast transformation of the strain OS49.67 (T. Kieser 
et al. 9 2000). The inactivation of the orfl4 gene has already been described in the case 
of the strain OSC2 (cf. example 13) and the same procedure was carried out for the 
inactivation of the orfl4 gene in the strain OS49.67. After transformation with the 
plasmid pSPM509, the clones were selected for their apramycin resistance. The 
apramycin-resistant clones were then subcultured respectively on medium with 
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apramycin and on medium with hygromycin. The clones resistant to apramycin (ApraR) 
and sensitive to hygromycin (HygS) are in principle those in which a double crossing 
over event has occurred and in which the orfl4 gene has been replaced with a copy of 
orfl4 interrupted with the att3Qaac- cassette. These clones were more particularly 
selected and the replacement of the wild-type copy of orfl4 with the copy interrupted 
with the cassette was verified by hybridization. Thus, the total DNA of the clones 
obtained was digested with several enzymes, separated on agarose gel, transferred onto 
a membrane and hybridized with a probe corresponding to the att3Qaac- cassette in 
order to verify that the gene replacement had indeed taken place. The verification of the 
genotype can also be carried out by any method known to those skilled in the art, and in 
particular by PCR using the appropriate oligonucleotides and sequencing of the PCR 
product. 

A clone exhibiting the expected characteristics (Aor/3, orfl4::att3£2aac-) was 
more particularly selected and named SPM510. It was in fact possible to verify, by 
virtue of the two hybridizations, that the attSQaac- cassette was indeed present in the 
genome of this clone and that the digestion profile expected in the case of a 
replacement, subsequent to a double recombination event, of the wild-type orfl4 gene 
with the copy interrupted with the att3Qaac- cassette in the genome of this clone is 
indeed obtained. 



EXAMPLE 22: Functional complementation of the interruption of the orfl4 gene 

22.1 Construction of the plasmid pSPM519: 

The or/14 gene was amplified by PCR using the following pair of 
oligonucleotides: EDR31: 5' CCCAAGCTTCTGCGCCCGCGGGCGTGAA 3' (SEQ 
ID No. 136) and EDR37: 5' GCTCTAGAACCGTGTAGCCGCGCCCCGG 3' (SEQ ID 
No. 137) and, as matrix, the chromosomal DNA of the strain OSC2. The 
oligonucleotides EDR31 and EDR37 carry, respectively, the Hindll! and Xbal 
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restriction site (sequence in bold). The 1.2 kb fragment thus obtained was cloned into 
the vector pGEM-T easy (marketed by the company Promega (Madison, Wisconsin, 
USA)), to give the plasmid pSPM515. This plasmid was then digested with the Hindlll 
and Xbal restriction enzymes. The 1.2 kb HindlU/Xbal insert obtained was cloned into 
5 the vector pUWL201 (cf. example 17.1) digested beforehand with the same enzymes. 
The plasmid thus obtained was named pSPM519. 

22.1 Transformation of the stra ins SPM509 and SPM510 with p lasmid 
. PSPM519: 

The plasmid pSPM519 was introduced into the strains SPM509 (cf. example 13) 
10 and SPM510 (cf. example 17) by protoplast transformation (T. Kieser et al, 2000). 
After transformation, the clones were selected for their thiostrepton resistance. The 
clones were then subcultured on a medium containing thiostrepton. 

The strain SPM509 is a spiramycin nonproducer strain (cf. examples 13, 15 and 
16 and figure 24). The spiramycin production of the strain SPM509 transformed with the 

1 5 vector pSPM5 1 9 (strain named SPM509 pSPM5 1 9) was analyzed by culturing this strain 
in MP5 medium in the presence of thiostrepton. The culture supematants were then 
analyzed by HPLC (cf. examples 16 and 17). The results of this analysis are given in table 
43, the data are expressed in mg per liter of supernatant. The results correspond to the 
total production of spiramycins (obtained by adding the production of spiramycin I, II and 

20 IH). It was observed that the presence of the vector pSPM519 in the strain SPM509 
restores the spiramycin production (cf. table 43). 

Table 43. Spiramycin production of the strain SPM509 transformed with the 
vector pSPM519, (results expressed in mg/1 of supernatant). 



Strain 


Spiramycin production 


SPM509 pSPM519 


58 



175 



The strain SPM510 transformed with the plasmid pSPM519 was named SPM510 
pSPM509. 

EXAMPLE 23: Functional complementation of the interruption of the or/3 gene 
5 by the tylB gene of S. fradiae 

23.1 Construction of the plasmid pOS49.52: 

The plasmid pOS49.52 corresponds to a plasmid which allows expression of the 
TylB protein in S. ambofaciens. In order to construct it, the coding sequence of the tylB 
gene of S. fradiae (Merson-Davies & Cundliffe, 1994, GenBank accession number: 

10 U08223 (sequence of the region), SFU08223 (DNA sequence) and AAA21342 (protein 
sequence)) was introduced in the plasmid pKC1218 (Bierman et al. y 1992, Kieser et aL, 
2000, a strain of E. coli containing this plasmid is accessible in particular from the ARS 
(NRRL) Agricultural Research Service Culture Collection) (Peoria, Illinois, USA), 
under the number B-14790). In addition, this coding sequence was placed under the 

15 control of the ermE* promoter (cf. above, in particular example 17.1, Bibb et aL, 1985, 
Bibb etal, 1994). 

23.1 Transformation of the strain OS49.67 with the plasmid pOS49.52: 

The strain OS49.67 in which the or/3 gene is inactivated by an in-phase deletion 
does not produce spiramycins (cf. examples 6 and 15). The plasmid pOS49.52 was 
20 introduced into the strain OS49.67 by protoplast transformation (T. Kieser et aL, 2000). 
After transformation, the clones were selected for their apramycin resistance. The 
clones were then subcultured on a medium containing apramycin. A clone was more 
particularly selected and was named OS49.67 pOS49.52. 

As was demonstrated above, the strain OS49.67 does not produce spiramycins 
25 (cf. examples 6 and 15). The spiramycin production of the strain OS49.67 transformed 
with the vector pOS49.52 was analyzed by the technique described in example 15. It 
was thus possible to demonstrate that this strain has a spiramycin producer phenotype. 
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Thus, the TylB protein allows functional complementation of the interruption of the 
or/3 gene. 



EXAMPLE 24: Improvement of spiramycin production by overexpression of the 
orf28c gene 

24.1 Construction of the plasmid pSPM75: 

The orf28c gene was amplified by PCR using a pair of oligonucleotides 
comprising a Hindlll restriction site or a BamHI restriction site. These primers have the 
following sequence: 

KF30: 5' AAGCTTGTGTGCCCGGTGTACCTGGGGAGC 3' (SEQ ID No. 138) 
with a Hindlll restriction site (which is shown in bold) 

KF31: 5' GGATCCCGCGACGGACACGACCGCCGCGCA 3' (SEQ ID No. 139) 
with the BamHI restriction site (which is shown in bold) 

The primers KF30 and KF31 carry, respectively, the Hindlll and BamHI restriction sites 
(sequence in bold). The pair of primers KF30 and KF31 makes it possible to amplify a 
DNA fragment approximately 1 .5 kb in size containing the or/2 8c gene, using as matrix 
the cosmid pSPM36 (cf. above). The 1.5 kb fragment thus obtained was cloned into the 
vector pGEM-T easy (marketed by the company Promega (Madison, Wisconsin, USA)) 
to give the plasmid pSPM74. The plasmid pSPM74 was then digested with the Hindlll 
and BamHI restriction enzymes and the approximately 1.5 kb HindllVBamHl insert 
obtained was subcloned into the vector pUWL201 (cf. example 17.1) digested 
beforehand with the same enzymes. The plasmid thus obtained was named pSPM75; it 
contains all the coding sequence of orf28c placed under the control of the ermE* 
promoter. 

24.2 Transform ation of the strain OSC2 with the plasmid pSPM75: 

The plasmid pSPM75 was introduced into the strains OSC2 by protoplast 
transformation (T. Kieser et al, 2000). After protoplast transformation, the clones were 
selected for their thiostrepton resistance. The clones were then subcultured on a 
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medium containing thiostrepton and the transformation with the plasmid was verified 
by plasmid extraction. Two clones were more particularly selected and named 
OSC2/pSPM75(l) and OSC2/pSPM75(2). 

A sample of the strain OSC2/pSPM75(2) was deposited with the Collection 
Nationale de Cultures de Microorganismes (CNCM) [National Collection of Cultures 
and Microorganisms] Pasteur Institute, 25, rue du Docteur Roux 75724 Paris Cedex 15, 
France, on October 6, 2003 under the registration number 1-3101. 

In order to test the effect of overexpression of the orf28c gene on spiramycin 
production, the spiramycin production of the OSC2/pSPM75(l) and OSC2/pSPM75(2) 
clones was tested by the technique described in example 15. Analysis of the spiramycin 
production of the strain OSC2 was also carried out in parallel for comparison. It was 
thus possible to observe that the presence of the plasmid pSPM75 significantly 
increases the spiramycin production of the strain OSC2. This demonstrates than 
overexpression of orf28c leads to an increase in spiramycin production and confirms its 
role as a regulator. 

The spiramycin production of the OSC2/pSPM75(l) and OSC2/pSPM75(2) 
clones was also analyzed by HPLC (in the same way as in example 17.2). Analysis of 
the spiramycin production of the strain OSC2 was also carried out in parallel for 
comparison. The results of this analysis are given in table 44, the data are expressed in 
mg per liter of supernatant. The results correspond to the total production of 
spiramycins (obtained by adding the production of spiramycin I, II and III). 

Table 44. Spiramycin production of the strains derived from OSC2 transformed 
with pSPM75, (results expressed in mg/1). 
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Strain 


Spiramycins 


OSC2 


50 


OSC2/pSPM75(l) 


120 


OSC2/pSPM75(2) 


155 



Thus, it is observed that the presence of the plasmid pSPM75 significantly 
increases the spiramycin production of the strain OSC2. This clearly demonstrates that 
overexpression of orf28c has a positive effect on the spiramycin production. 

5 

EXAMPLE 25: Analysis of the production of spiramycin biosynthesis 
intermediates by a strain which has been inactivated in the or/8 gene: 

The strain OS49.107, in which the or/8 gene is inactivated by insertion of the 
Ohyg cassette, does not produce spiramycins (cf. examples 7 and 15). The or/8 gene 

10 encodes a protein exhibiting relatively strong similarity with several aminotransferases 
and strongly suggests that the or/8 gene encodes a 4-aminotransferase responsible for 
the transamination reaction required for forosamine biosynthesis (cf. figure 6). It is 
therefore expected that spiramycin biosynthesis will be blocked at the forocidin stage 
(cf. figure 7). The strain OS49.107, which is a spiramycin non-producer, should 

15 therefore produce forocidin. 

A sample of supernatant of the strain OS49.107 was prepared according to the 
method described above (cf. example 16, without MIBK extraction) and was analyzed 
by LC/MS as described above (cf. paragraph 20.2 and 20.3). In SIM mode, the mass 
558 relating to the molecular ion of forocidin was selected and several peaks were 

20 detected. The presence of compounds of mass 558 is compatible with the hypothesis of 
orf8 having a role in forosamine synthesis. 
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EXAMPLE 26: Analysis of the production of spiramycin biosynthesis 
intermediates by a strain which has been inactivated in the orfl2 gene: 

The strain SPM507, in which the orfl2 gene is inactivated, does not produce 
spiramycins (cf. examples 11 and 15). The orfl2 gene is thought to encode a 3,4- 
5 dehydratase responsible for the dehydration reaction required for forosamine 
biosynthesis (cf. figure 6). It is therefore expected that the spiramycin biosynthesis will 
be blocked at the forocidin stage (cf. figure 7). The strain SPM507, which is a 
spiramycin non-producing strain, should therefore produce forocidin. 

A sample of supernatant of the strain SPM507 was prepared according to the 

10 method described above (cf. example 16, without MIBK extraction) and was analyzed 
by LC/MS as described above (cf. paragraphs 20.2 and 20.3). Under these conditions, 
the forocidin retention time is approximately 12.9 minutes. In SIM mode, the mass 558 
relating to the molecular ion [M+H] + of forocidin was selected and a peak was detected. 
The presence of a compound at 558 makes it possible to validate the hypothesis that the 

15 product of orfl 2 has a role in spiramycin biosynthesis. 

However, the forocidin is present in a relatively low amount and, under these 
conditions, a product which absorbs at 238 nm was more particularly observed 
(retention time of 17.1 min). The LC/MS analysis made it possible to determine the 
mass of this compound, which is 744.3 g/mol ([M+H] + = 744.3 major product). 

20 In order to obtain the structure, the products mentioned above were isolated and 

purified under the conditions described above (cf. paragraph 20.1). The organic phase 
(MIBK) is then recovered and evaporated. The dry extract is taken up with water and 
extracted with heptane. The aqueous solution is then extracted by binding to an Oasis 
HLB 1 g cartridge (Waters SAS, St-Quentin en-Yvelines, France). The compound is 

25 recovered by elution with a 30/70 water/acetonitrile mixture. This solution is then 
injected (100 jil) onto the analytical column and the fractions are recovered on an Oasis 
HLB 1 cc 30 mg cartridge (Waters). Before use, the Oasis HLB 1 cc 30 mg cartridges 
(Waters) are conditioned sequentially with acetonitrile, and then a water/acetonitrile 
mixture (20v/80v) and an 80/20 water/acetonitrile mixture. 
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The Oasis HLB 1 cc 30 mg cartridge (Waters) is then washed successively with 
1 ml of water/acetonitrile (95/5) and 1 ml of water/deuterated acetonitrile (95/5), and 
then eluted with 600 pi of 40/60 water/deuterated acetonitrile. The recovered solution is 
then directly analyzed by NMR. 

5 The NMR spectrum obtained for this compound is as follows (19312V NMR 

spectrum): 

1H spectrum in CD3CN/D20 (chemical shifts in ppm): 0.92 (3H, d, J=6Hz), 1.10 (1H, 
unresolved peak), 1.14 (3H, s), 1.17 (3H, d, J=6Hz), 1.22 (3H, d, J=6Hz), 1.25 (3H, d, 
J=6Hz), 1.40 (1H, unresolved peak), 1.75 (1H, dd, J=12 and 2Hz), 1.81 (1H, unresolved 

10 peak), 1.90 (1H, d, J=12Hz), 2.05 (1H, unresolved peak), 2.12 (3H, s), 2.15 (1H, 
unresolved peak), 2.35 (2H, unresolved peak), 2.45 (6H, broad s), 2.53 (1H, unresolved 
peak), 2.64 (1H, dd, J=12 and 9Hz), 2.80 (1H, dd, J=9 and 16Hz), 2.95 (1H, d, J=8Hz), 
3.23 (2H, unresolved peak), 3.34 (1H, d, J=7Hz), 3.45 (3H, s), 3.49 (1H, unresolved 
peak), 3.93 (1H, dd, J=7 and 3Hz), 4.08 (1H, unresolved peak), 4.37 (1H, d, J=6Hz), 

15 4.88 (1H, unresolved peak), 5.05 (2H, unresolved peak), 5.65 (2H, unresolved peak), 
6.08 (1H, dd, J=8 and 12Hz), 6.40 (1H, dd, J=12 and 9Hz), 9.60 (1H, s). 

These experiments thus made it possible to determine the structure of this 
compound, this structure is given in figure 38. 

20 EXAMPLE 27: Analysis of the production of spiramycin biosynthesis 
intermediates by a strain which has been inactivated in the or/5* gene: 

The strain SPM501 has the genotype or/6*::att]Qhyg+. By virtue of the polar 
effect of the insertion of the attlQhyg+ cassette into the orf 6* gene, it was possible to 
determine that the orf5* gene is essential to the spiramycin biosynthesis pathway. 
25 Specifically, insertion of the excisable cassette into the coding portion of the orf6* gene 
leads to complete arrest of spiramycin production by virtue of a polar effect on the 
expression of the orft* gene. However, once the inserted cassette has been excised (and 
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therefore when only the or/6* gene is inactivated, cf. examples 14 and 15), production 
of spiramycin I is restored. This demonstrates that the or/5* gene is essential to 
spiramycin biosynthesis since its inactivation leads to a complete arrest of spiramycin 
production. 

The or/5* gene encodes a protein exhibiting relatively strong similarity with 
several O-methyltransferases. The orf5* gene is thought to be an O-methyltransferase 
involved in platenolide biosynthesis. To verify this hypothesis, LC/MS and NMR 
analytical experiments were carried out on a strain of S. ambofaciens of genotype 
orf6*::attlQhyg+, obtained from a strain which overproduces spiramycins. 

A sample of the supernatant of this strain was prepared according to the method 
described above (cf. example 16, without MIBK extraction) and was analyzed by 
LC/MS as described above (cf. paragraph 20.2 and 20.3). However, the column used is 
an X-Terra column (Waters SAS, St-Quentin en-Yvelines, France), and the cone 
voltage of the spectrometer is set at 380V so as to obtain fermentation of the compound 
analyzed. Under these conditions, a product for which the retention time is 
approximately 13.1 minutes is observed. The mass spectrum of this compound has an 
appearance similar to that of spiramycin I, but the molecular ion is 829. The difference 
in mass of 14 compared with the mass of spiramycin can be explained by the absence of 
methyl group on the oxygen borne by carbon No. 4 of the lactone ring (the structure of 
this compound is given in figure 39). The presence of a compound at 829 makes it 
possible to validate the hypothesis that or/5* has a role in spiramycin biosynthesis. 
Using a microbiological test carried out on a sensitive strain of M. luteus 
(cf. example 15 and figure 18), it was demonstrated that the intermediate molecule 
(spiramycin minus methyl group, the structure of which is given in figure 39) produced 
by the strain or/6* ::Oatthyg+ is much less active (by a factor of 10) than the 
spiramycin of origin with the methyl group at position 4. 
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EXAMPLE 28 : Construction of new "excisable cassettes": 

New excisable cassettes were constructed. These cassettes are very similar to the 
excisable cassettes already described in example 9. The main difference between the 
former cassettes and the new cassettes is the absence, in the latter, of the sequences 
corresonding to the ends of the Q interposon, which sequences contain a transcription 
terminator originating from the T4 phage. 

In the cassettes without terminator, the gene which confers the resistance to an 
antibiotic is flanked by the attR and attL sequences which allow the excision. The 
resistance gene is the aac(3)IV gene which encodes an acetyltransferase which confers 
apramycin resistance. This gene is present in the Oaac cassette (GenBank accession 
number: X99313, Blondelet-Rouault, M.H. et al., 1997) and was amplified by PCR 
using, as matrix, the plasmid pOSK1102 (cf. above) and, as primers, the 
oligonucleotides KF42 and KF43 each containing the HindW. restriction site (in bold) 
(AAGCTT) in the 5' position. 

KF42: 5'-AAGCTTGTACGGCCCACAGAATGATGTCAC-3' 
(SEQIDNo. 153) and 

KF43 : 5'-AAGCTTCGACTACCTTGGTGATCTCGCCTT-3' 
(SEQIDNo. 154). ' 

The PCR product obtained, of approximately 1 kb, was cloned into the E. coli 
vector pGEMT Easy, producing the plasmid pSPM83. 

The vector pSPM83 was digested with the Hindlll restriction enzyme. The 
Hindm-Hindm fragment of the insert was isolated by purification from a 0.8% agarose 
gel, and then cloned into the Hindlll site located between the attL and attR sequences of 
the various plasmids carrying the various possible excisable cassettes (cf. example 9 
and figure 27) so as to replace the Hindlll fragment corresponding to Qacc with the 
Hindlll fragment corresponding to the aac gene alone. This made it possible to obtain 
the attlaac, att2aac and att3aac cassettes (depending on the desired phase, 
cf. example 9). Depending on the orientation of the aac gene relative to the attL and 
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attR sequences, attJaac+, attlaac-, att2aac+, att2aac, att3aac+ and att3aac- are 
distinguished (according to the same conventions as those adopted in example 9). 



EXAMPLE 29: Construction of a strain of S. ambofaciens with a knockout in the 
orf28c gene: 

The orf28c gene was inactivated using the excisable cassette technique. The 
excisable cassette att3aac+ (cf. example 28) was amplified by PCR using, as matrix, 
the plasmid pSPMlOl (the plasmid pSPMlOl is a plasmid derived from the vector 
pGP704Not (Chaveroche et aL, 2000) (Miller VL & Mekalanos JJ, 1988) in which the 
att3aac+ cassette has been cloned as an EcoRV fragment into the unique EcoKV site of 
pGP704Not) and using the following primers: 
KF32: 

5'CAACCGCTTGAGCTGCTCCATCAACTGCTGGGCCGAGG TATCGCGCGCG 
CTTCGTTC GGGACGAA 3' (SEQ ID No. 155) 
and KF33: 

5 ' TGGGTCCCGCCGCGCGGCACGACTTCG ACTCGCTCGTC TATCTGCCTCTT 
CGTCCCGA AGCAACT 3' (SEQ ID No. 156) 

The 39 nucleotides located at the 5' end of these oligonucleotides contain a 
sequence corresponding to a sequence in the orJ28c gene and the 26 nucleotides located 
in the most 3' position (shown in bold and underlined above) correspond to the 
sequence of one of the ends of the excisable cassette att3aac+ 

The PCR product thus obtained was used to transform the hyper-recombinant E. coli 
strain DY330 (Yu et aL, 2000) (this strain contains the exo, bet and gam genes of the 
lambda phage, integrated into its chromosome, these genes are expressed at 42°C, it 
was used in place of the E. coli strain KS272 (Chaveroche et aL, 2000)) containing the 
cosmid pSPM36. Thus, the bacteria were transformed by electroporation with this PCR 
product and the clones were selected for their resistance to apramycin. The cosmids of 
the clones obtained were extracted and digested with the BamHl restriction enzyme 
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with the aim of verifying that the digestion profile obtained corresponded to the profile 
expected if insertion of the cassette (att3aac+) into the orJ28c gene had taken place, i.e. 
if there had indeed been homologous recombination between the ends of the PCR 
product and the target gene. The construct can also be verified by any method known to 
those skilled in the art, and in particular by PCR using the appropriate oligonucleotides 
and sequencing of the PCR product. A clone for which the cosmid has the expected 
profile was selected and the corresponding cosmid was called pSPM107. This cosmid is 
a derivative of pSPM36, in which orf28c is interrupted by the att3aac+ cassette. 
Insertion of the cassette is accompanied by a deletion in the orf28c gene, the 
interruption begins at the 28th codon of orf28c. After the cassette, there remains the last 
137 codons of orf28c. 



The cosmid pSPM107 was, in a first step, introduced into the E. coli strain DH5a, 
and then into the Streptomyces ambofaciens strain OSC2 by protoplast transformation. 
After transformation, the clones were selected for their resistance to apramycin. The 
apramycin-resistant clones were then subcultured respectively on medium with 
apramycin (antibiotic B) and on medium with puromycin (antibiotic A) (cf. figure 9). 
The clones resistant to apramycin (ApraR) and sensitive to puromycin (PuroS) are, in 
principle, those in which a double crossing over event has occurred and which possess 
the orf28c gene interrupted by the att3aac+ cassette. These clones were more 
particularly selected and the replacement of the wild-type copy of orf28c with the copy 
interrupted by the cassette was verified by hybridization. Thus, the total DNA of the 
clones obtained was digested with several enzymes, separated on agarose gel, 
transferred onto a membrane and hybridized with a probe corresponding to the 
att3aac+ cassette so as to verify the presence of the cassette at the expected locus in the 
genomic DNA of the clones obtained. The genotype can also be verified by any method 
known to those skilled in the art, and in particular by PCR using the appropriate 
oligonucleotides and sequencing of the PCR product. 

A clone exhibiting the expected characteristics (orJ28c::att3aac+) was more 
particularly selected and called SPM107. This clone therefore has the genotype: 
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orf28c::att3aac+ and was called SPM107. In view of the orientation of the genes 
(cf. figure 3), it is pointless to excise the cassette to study the effect of the inactivation 
of orf28c. The fact that or/29 is oriented in the opposite direction to orf28c shows that 
these genes are not cotranscribed. The use of an excisable cassette makes it possible, on 
5 the other hand, to have the possibility of being rid of the selection marker at any time, in 
particular by transformation with the plasmid pOSV508. 

In order to test the effect of turning off the orf28c gene on spiramycin 
production, the spiramycin production of the strain SPM107 was tested by the 
technique described in example 15. It was thus possible to demonstrate that this strain 
10 has a spiramycin non-producing phenotype. This demonstrates that the orf28c gene is a 
gene essential to spiramycin biosynthesis in S. ambofaciens. 

EXAMPLE 30: Construction of a strain of S. ambofaciens with knockout in the 
or/31 gene: 

15 

The or/31 gene was inactivated using the excisable cassette technique. The 
excisable cassette att3aac+ was amplified by PCR using, as matrix, the plasmid 
pSPMlOl, and the oligonucleotides EDR71 and EDR72. 
EDR71: 

20 5' CGTCATCGACGTGCGGGGAAGACAGAGGTGATACCG ATGATCGCGCGC 
GCTTCGTTCGGGACGAA 3' (SEQ ID No. 157) 

EDR72: 

5 ' GCCAGCACCTCGTCCAGCTGCTCGACGGAACTCACCCCC ATCTGCCTCT 
25 TCGTCCCGAAGCAACT 3 ' (SEQ ID No. 1 58) 

The 39 nucleotides located at the 5' end of these oligonucleotides contain a 
sequence corresponding to a sequence in the orJ31 gene and the 26 nucleotides located 
in the most 3* position (shown in bold and underlined above) correspond to the 
30 sequence of one of the ends of the excisable cassette att3aac+. 
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The PCR product thus obtained was used to transform the E. coli strain KS272 
containing the plasmid pKOBEG and the cosmid pSPM36, as described by Chaveroche 
et al. (Chaveroche et al., 2000) (cf. figure 12 for the principle, the plasmid pOS49.99 
should be replaced with the cosmid pSPM36 and the plasmid obtained is no longer 
pSPM17 but pSPM543). Thus, the bacteria were transformed with this PCR product by 
electroporation and the clones were selected for their resistance to apramycin. The 
cosmids of the clones obtained were extracted and digested with several restriction 
enzymes with the aim of verifying that the digestion profile obtained corresponds to the 
profile expected if insertion of the cassette (att3aac+) into the orJ31 gene has taken 
place, i.e. if there has indeed been homologous recombination between the ends of the 
PCR product and the target gene. The construct can also be verified by any method 
known to those skilled in the art, and in particular by PCR using the appropriate 
oligonucleotides and sequencing of the PCR product. A clone for which the cosmid has 
the expected profile was selected and the corresponding cosmid was called pSPM543. 
This cosmid is a derivative of pSPM36, in which or/31 is interrupted by the att3aac+ 
cassette (cf. figure 12). The insertion of the cassette is accompanied by a deletion in the 
or/31 gene, the interruption begins at the thirty-sixth codon of or/31. After the cassette, 
there remain the last 33 codons of orf31. 

The cosmid pSPM543 was introduced into the Streptomyces ambofaciens strain 
OSC2 (cf. above) by protoplast transformation (Kieser, T et al., 2000). After 
transformation, the clones were selected for their resistance to apramycin. The 
apramycin-resistant clones were then subcultured respectively on medium with 
apramycin (antibiotic B) and on medium with puromycin (antibiotic A) (cf. figure 9). 
The clones resistant to apramycin (ApraR) and sensitive to puromycin (PuroS) are, in 
principle, those in which a double crossing over event has occurred and which possess 
the or/31 gene interrupted by the att3aac+ cassette. These clones were more 
particularly selected and the replacement of the wild-type copy of or/31 with the copy 
interrupted by the cassette was verified by hybridization. Thus, the total DNA of the 
clones obtained was digested with several enzymes, separated on agarose gel, 
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transferred onto membrane and hybridized with a probe corresponding to the att3aac+ 
cassette so as to verify the presence of the cassette at the expected locus in the genomic 
DNA of the clones obtained. A second hybridization was carried out using, as probe, a 
DNA fragment obtained by PCR and corresponding to a very large portion of the 
coding sequence of the or/3 J gene. 

The genotype can also be verified by any method known to those skilled in the 
art, and in particular by PCR using the appropriate oligonucleotides and sequencing of 
the PCR product. 

A clone exhibiting the expected characteristics (or/31 ::att3aac+) was more 
particularly selected and called SPM543. It was in fact possible to verify, by virtue of 
the two hybridizations, that the att3aac+ cassette was indeed present in the genome of 
this clone and that the digestion profile expected in the case of a replacement, 
subsequent to a double recombination event, of the wild-type gene with the copy 
interrupted by the att3aac+ cassette in the genome of this clone was indeed obtained. 
This clone therefore has the genotype: or/31::att3aac+ and was called SPM543. In 
view of the orientation of the genes (cf. figure 3), it is pointless to excise the cassette to 
study the effect of the inactivation of or/31. The fact that or/32c is oriented in the 
opposite direction to or/31 shows tfiat these genes are not cotranscribed. The use of an 
excisable cassette makes it possible, on the other hand, to have the possibility of being 
rid of the selection marker at any time, in particular by transformation with the plasmid 
pOSV508. 

In order to test the effect of turning off the or/3 J gene on spiramycin production, 
the spiramycin production of the strain SPM543 was tested by the technique described 
in example 15. It was thus possible to demonstrate that this strain has a spiramycin non- 
producing phenotype. This demonstrates that the or/31 gene is a gene essential to 
spiramycin biosynthesis in S. ambofaciens. 
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EXAMPLE 31: Construction of a strain of S. ambofatiens with a knockout in the 
or/32c gene: 

The orf32c gene was inactivated using the excisable cassette technique. The 
excisable cassette att3aac+ was amplified by PCR using, as matrix, the plasmid 
pSPMlOl and using the following primers: 
KF52: 

5 1 GATCCGCCAGCCTCACGTCACGCCGCGCCGCCTCCCTGACATCGCGCGC 
GCTTCGT TCGGGACGA A 3' (SEQ ID No. 159). 
and KF53: 

5' GAGGCGGACGTCGGTACGCGGTGGGAGCCGGAGTTCGACA ATCTGCCTC 
TTCGTCCC GAAGCAACT 3' (SEQ ID No. 160). 

The 40 nucleotides located at the 5' end of these oligonucleotides contain a 
sequence corresponding to a sequence in the orj32c gene and the 26 nucleotides located 
in the most 3' position (shown in bold and underlined above) correspond to the 
sequence of one of the ends of the excisable cassette att3aac+. 

The PCR product thus obtained was used to transform the hyper-recombinant 
E.coli strain DY330 (Yu et al:, 2000) containing the cosmid pSPM36. Thus, the 
bacteria were transformed with this PCR product by electroporation and the clones were 
selected for their resistance to apramycin. The cosmids of the clones obtained were 
extracted and digested with the BamHl restriction enzymes with the aim of verifying 
that the digestion profile obtained corresponds to the expected profile if insertion of the 
cassette (att3aac+) into the orJ32c gene has occurred, i.e. if there has indeed been 
homologous recombination between the ends of the PCR product and the target gene. 
The construct can also be verified by any method known to those skilled in the art, and 
in particular by PCR using the appropriate oligonucleotides and sequencing of the PCR 
product. A clone for which the cosmid has the expected profile was selected and the 
corresponding cosmid was called pSPM106. This cosmid is a derivative of pSPM36, in 
which orf32c is interrupted by the att3aac+ cassette. The insertion of the cassette is 
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accompanied by deletion in the orJ32c gene, the interruption begins at the 1 12th codon 
of or/3 2c. After the cassette, there remains the last 91 codons of or/3 2c. 

The cosmid pSPM106 was, in a first step, introduced into the E. coli strain DH5ct, 
and then into the Streptomyces ambofaciens strain OSC2 by transformation. After 
transformation, the clones are selected for their resistance to apramycin. The 
apramycin-resistant clones are then subcultured respectively on medium with 
apramycin (antibiotic B) and on medium with puromycin (antibiotic A) (cf. figure 9). 
The clones resistant to apramycin (ApraR) and sensitive to puromycin (PuroS) are, in 
principle, those in which a double crossing over event has occurred and which possess 
the orf32c gene interrupted by the att3aac+ cassette. These clones were more 
particularly selected and the replacement of the wild-type copy of orf32c with the copy 
interrupted by the cassette was verified by hybridization. Thus, the total DNA of the 
clones obtained was digested with several enzymes, separated on agarose gel, 
transferred onto membrane and hybridized with a probe corresponding to the att3aac+ 
cassette so as to verify the presence of the cassette in the genomic DNA of the clones 
obtained. The genotype can also be verified by any method known to those skilled in 
the art, and in particular by PCR using the appropriate oligonucleotides and sequencing 
of the PCR product. 

A clone exhibiting the expected characteristics (orJ32c::att3aac+) was more 
particularly selected. This clone therefore has the genotype: orf32c::att3aac+ and was 
called SPM106. In view of the orientation of the genes (cf. figure 3), it is pointless to 
excise the cassette to study the effect of the inactivation of the orj32c. The fact that 
or/33 is oriented in the opposite direction to orJ32c shows that these genes are not 
cotranscribed. The use of an excisable cassette makes it possible, on the other hand, to 
have the possibility of being rid of the selection marker at any time, in particular by 
transformation with the plasmid pOSV508. 

In order to test the effect of turning off the orf32c gene on spiramycin 
production, the spiramycin production of the strain SPM106 was tested by the 
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technique described in example 15. It was thus possible to demonstrate that this strain 
has a spiramycin-producing phenotype. This demonstrates that the or/32c gene is not a 
gene essential to spiramycin biosynthesis in S. ambofaciens. 
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List of the constructs described in the present application 



List of abbreviations: Am: Ampicillin; Hyg: Hygromycin; Sp: Spiramycin; 
Ts: Thiostrepton; Cm: Chloramphenicol. Kn: Kanamycin, Apra: apramycin. 



Construct 
name 


Selection 
marker 


Main characteristics 


Reference 


pWE15 


Am 




(Wahl, et aL, 1987) 


pWEDl 


Am 


pWE15 in which a 4.1 kb Hpah 
Hpal fragment has been deleted 


(Gourmelen et ai, 
1998) 


pOJ260 


Apra 


Conjugative, nonreplicative in 
Streptomyces 


(Bierman etal, 1992) 


pHP45 Qhyg 


Hyg 


Qhyg cassette in pHP45. 


(Blondelet-Rouault et 
aL, 1997) 


pKC505 


Apra 


Cosmid 


(Richardson MA et 
a!., 1987) 


pIJ486 


Ts 


Streptomyces multicopy 
replicative plasmid 


(Warded/., 1986) 


pOSint3 


Am 


ptrc-xis-int in pTrc99A 


(Raynal eta!., 1998) 


pWHM3 


Am, Ts 


E. coli/Streptomyces replicative 
shuttle vector 


(Vara et aL, 1989) 


pKOBEG 


Cm 




(Chaveroche et aL, 
2000) 


pGP704Not 


Am 




(Chaveroche et aL, 
2000) 


pMBL18 


Am 




(Nakano etal, 1995) 


pGEM-T 
Easy 


Am 


E, coli vector for cloning PCR 
products 


Mezei etal, 1994 


pOS49.1 


Am 


pWEDl with insert at thetfawHI 
site 


Example 2 
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pOS49.11 


Am 


Sad fragment of pOS49. 1 in 
puC19. 


Example 2 




Un 


ATioI fragment of pOS49. 1 1 in 
pBC SK+ 


Example 2 


pOS49.l4 


Cm , Hyg 


pOSC49.12 with the or/3 gene 
interrupted with the Qhyg cassette 


Example 2 


pOS49.l6 


Apra, Hyg 


Insert of pOS49.14 in pOJ260 


Example 2 


pOS49.28 


Cm 


3.7 kb Bamm-Pstl fragment of 
pOS49.1 inpBC SK+ 


Example 3 


pOS44.1 


Apra, Sp 


pKC505 containing in insert 
imparting spiramycin resistance in 
5. griseofuscus 


(Pernodetefa/., 1999) 


„ /~\ O A A ^ 

pOS44.2 


Ts, Sp 


1.8 kb Sau3A\ fragment of 
pOS44.1 inpIJ486 


Example 3 


^/^O AAA 

pOS44.4. 


Am 


Insert of pOS44.2 in pUC19 


Example 3 


pSPM5 


Am 


pWEDl withiS*. ambofaciens 
DNA insert at the BamHl site 


Example 3 


pSPM7 


Am 


pWED 1 with & ambofaciens 
DNA insert at the BamHl site 


Example 3 


pOSK1205 


Hyg 


pBK-CMV in which hyg replaces 
neo 


Example 5 


pOS49.67 


Apra 


EcoRl-Sacl fragment of pOS49.1, 
comprising an internal deletion of 
D\jH nucieoiiaes, in pL/JZOU 


Example 6 


pOS49.88 


Am 


3.7 kb Pstl-EcoRl fragment of 
pOS49.1 inpUC19 


Example 7 


pOS49.106 


Am 


p049.88 with hyg in or/8 (hyg and 
or/8 in the same orientation) 


Example 7 
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pOS49.l20 


Am 


pOS49.88 with hyg in or/8 (hyg 
and or/5 in opposite orientations) 


Example 7 


pOS49.l07 


Apra, Hyg 


Insert of pOS49.106 in pOJ260 


Example 7 


pOS49.32 


Am, Kn 


1.5 kb fragment within or/70, in 
pCR2.1-TOPO 


Example 8 


pOS49.43 


Am, Kn 


pOS49.32 with hyg in or/10 (hyg 
and orfl 0 in the same orientation) 


Example 8 


pOS49.44 


Am, Kn 


pOS49.32 with hyg in or/70 (hyg 
and orflO in opposite orientations) 


Example 8 


pOS49.50 


Apra, Hyg 


Insert of pO§49.43 in pOJ260 


Example 8 


pWHM3Hyg 


Am, Hyg 


pWHM3 in which tsr is replaced 
with hyg 


Example 10 


pOSV508 


Am, Ts 


ptrc-xis-int in pWHM3 


Example 9 


pattlQhyg+ 


Cm, Hyg 


attlQhyg+ cassette in pBC SK+ in 
which the Hindlll site has been 
deleted 


Example 9 


patt3Qaac- 


Cm, Gn 


att3fiaac- cassette in pBC SK+ in 
which the Hindlll site has been 
deleted 


Example 9 


pOSV5lO 


Am, Hyg 


pro pra-Amh in pWHM3Hyg 


Example 10 


pOS49.99 


Am 


4.5 kb EcoRl-BamHl fragment of 
pSPM5 in pUC19 


Example 10 




/vm, Apra 


pOF704Not containing the 
att3Qaac- cassette 


Example 10 


pSPM17 


Am, Apra 


pOS49.99 in which or/2 is 
interrupted with the alt3Qaac- 
cassette 


Example 10 
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pSPM21 


Hyg, Apra 


pOSK1205 containing the EcoRI- 
Xbal insert of pSPM17 (in which 
or/2 is interrupted with the 
att3Qaac- cassette) 


Example 10 


pSPM502 


Am 


15.1 kb BgHI fragment of pSPM7 
in pMBL18 


Example 1 1 


pSPM504 


Hyg 


Insert of pSPM502 in pOSK1205 


Example 1 1 


pSPM507 


Hyg, Apra 


pSPM504 in which orfl2 is 
interrupted with the att3Qaac- 
cassette 


Example 1 1 


pSPM508 


Hyg, Apra 


pSPM504 in which orfl3c is 
interrupted with the att3Qaac- 
cassette 


Example 12 


pSPM509 


Hyg, Apra 


pSPM504 in which orfl4 is 
interrupted with the att3Qaac- 
cassette 


Example 13 


pBXLllll 


Am 


1.11 kb fragment containing or/6* 
amplified by PCR from pSPM7, 
in the vector pGEM-T Easy 


Example 14 


pBXL1112 


Am, Hyg 


pBXLl 1 1 1 into which the 
attlQhyg+ cassette has been 
introduced after deletion of 120 
bp in the coding sequence of the 
orf 6* gene 


Example 14 


pBXL1113 


Apra, Hyg 


3.7 kb Pstl insert of pBXLl 1 12 in 
pOJ260 


Example 14 


pSPM520 


Am 


PCR fragment amplified by the 
oligonucleotides EDR39-EDR42 
in pGEM-T Easy 


Example 17 
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pSPM521 


Am 


PCR fragment amplified by the 
oligonucleotides EDR40-EDR42 
in pGEM-T Easy 


Example 17 


pSPM522 


Am 


PCR fragment amplified by the 
oligonucleotides EDR41-EDR42 
in pGEM-T Easy 


Example 17 


pUWL201 


Am , Ts 




(Doumith et al. 9 2000) 


pSPM523 


Am, Ts 


Hindlll-BamHl fragment of the 
insert of the plasmid pSPM520 in 
the vector pUWL201 


Example 17 


pSPM524 


Am, Ts 


Hindlll-BamHl fragment of the 
insert of the plasmid pSPM521 in 
the vector pUWL201 


Example 17 


pSPM525 


Am, Ts 


Hindlll-BamHl fragment of the 
insert of the plasmid pSPM522 in 
the vector pUWL201 


Example 17 


pSPM527 


Am 


pSPM521 with reading frame shift 
at thecal site 


Example 17 


pSPM528 


Am , Ts 


Hindlll-BamHl fragment of the 
insert of the plasmid pSPM527 in 
the vector pUWL201 


Example 17 


pVF 10.4 






(Vara et aL, 1985; 
Lacallee/a/., 1989) 


pPM803 


Ts 




(Mazodier,P. etal. 9 
1989) 


pGEM-T- 
pac-oriT 


Am 


pac-oriT cassette (amplified by 
PCR from pVF 10.4 and pPM803) 
in pGEM-T Easy 


Example 1 8 
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1 dWED2 


A TV! 


pac-oriT cassette obtained from 
pGEM-T-pac-oriT inserted into 
pWhDl 


Example 1 8 | 


pSPM34 


Am 


pWED2 with insert at the BamHl 
site 


Example 19 


pSPM35 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 1 


pSPM36 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 


pSPM37 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 ) 


pSPM38 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 j 


pSPM39 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 


pSPM40 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 


pSPM41 


Am 


p WED? with insert at the BamHI 
site 


Example 19 


pSPM42 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 I 


pSPM43 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 


pSPM44 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 j 


pSPM45 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 


pSPM47 


Am 


pWED2 with insert at the BamHI 
site 


Example 19 ] 
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pSPM48 


Am 


pWED2 with insert at the BamHl 
site 


Example 19 


pSPM50 


Am 


pWED2 with insert at the BamHl 
site 


Example 19 


pSPM51 


Am 


pWED2 with insert at the BamHl 
site 


Example 1 9 


pSPM52 


Am 


pWED2 with insert at the BamHl 
site 


Example 1 9 


pSPM53 


Am 


pWED2 with insert at the BamHl 
site 


Example 19 


pSPM55 


Am 


pWED2 with insert at the BamHl 
site 


Example 1 9 


pSPM56 


Am 


pWED2 with insert at the BamHl 
site 


Example 19 


pSPM58 


Kn 


Approximately 6 kb PsthPstl 
fragment of the insert of pSPM36 
in pBK-CMV 


Example 19 


pSPM72 


Kn 


Approximately 10 kb Stul-Stul 
fragment of the insert of pSPM36 
cloned into pBK-CMV 


Example 19 


pSPM73 


Cm 


EcoRl-Hin&lll fragment of the 
insert of pSPM72 in pBC-SK+ 


Example 19 


pSPM515 


Am 


PCR fragment amplified by 
EDR31-EDR37 in pGEM-T easy 


Example 22 


pSPM519 


Am, Ts 


HindllVXbal insert of pSPM515 
in pUWL201 


Example 22 


pOS49.52 


Apra 


Coding sequence of tylB under the 
control of the ermE* promoter in 
the plasmid pKC1218 


Example 23 
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pSPM74 


Am 


PCR fragment amplified by 
KF30-KF3 1 in pGEM-T easy 


Example 24 


pSPM75 


Am, Ts 


HindllVBamHl insert of pSPM74 
inpUWL201 


Example 24 


pSPM79 


Kn 


Approximately 2.5 kb Pstl-Pstl 
fragment of the insert of pSPM36 
in pBK-CMV 


Example 19 


pSPM83 


Am 


PCR fragment amplified by 
KF42-KF43 in pGEM-T easy 


Example 28 


pSPM107 


Am, Apra 


pSPM36 in which orf28c is 
interrupted with the att3aac+ 
cassette 


Example 29 


pSPM543 


Am, Apra 


pSPM36 in which or/3 J is 
interrupted with the att3aac+ 
cassette 


Example 30 


pSPM106 


Am, Apra 


pSPM36 in which orf32c is 
interrupted with the att3aac+ 
cassette 


Example 3 1 
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Depositing ofbiologjcaj material 

The following organisms were deposited with the Collection Nationale de Cultures 
de Microorganismes [National Collection of Cultures and Microorganisms] (CNCM), 25 
rue du Docteur Roux, 75724 Paris Cedex 15, France, on July 10, 2002, according to the 
5 provisions of the Treaty of Budapest. 

- Strain OSC2 under the registration number 1-2908. 

- Strain SPM501 under the registration number 1-2909. 

- Strain SPM502 under the registration number 1-29 1 0. 

- Strain SPM507 under the registration number 1-291 1 . 
10 - Strain SPM508 under the registration number 1-2912. 

- Strain SPM509 under the registration number 1-291 3. 

- Strain SPM21 under the registration number 1-2914. 

- Strain SPM22 under the registration number 1-291 5. 

- Strain OS49.67 under the registration number 1-291 6. 

15 - Strain OS49. 1 07 under the registration number 1-291 7. 

Escherichia Coli strain DH5a containing the plasmid pOS44;4, under the 
registration number 1-291 8. 

The following organisms were deposited with the Collection Nationale de 
Cultures de Microorganismes (CNCM), 25 rue du Docteur Roux, 75724 Paris Cedex 
20 15, France, on February 26, 2003, according to the provisions of the Treaty of 
Budapest. 



Strain SPM502 pSPM525 under the registration number 1-2977. 
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The following organisms were deposited with the Collection Nationale de 
Cultures de Microorganismes (CNCM), Pasteur Institute, 25 rue du Docteur Roux, 
75724 Paris Cedex 15, France, on October 6, 2003, according to the provisions of the 
Treaty of Budapest. 



5 



Strain OSC2/pSPM75(2) under the registration number 1-3101. 
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All the publications and patents cited are incorporated into the present application by 
way of reference. 
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